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PREFACE 


This investigation applied to Landsat data the advances 
and developments of the past decade in analyzing multispectral 
remote sensing measurements for crop identification and area 
estimation. Landsat MSS data for Kansas and Indiana were 
classified using computer-aided analysis techniques to identify 
and determine the areal extent and distribution of the major 
crops in the two state test area. It was conclusively demon- 
strated that Landsat data analyzed by computer methods could 
be effectively used to produce accurate estimates having 
extremely small sampling error. Recommendations are made for 
increasing the spectral, spatial and temporal resolution of 
data acquired by future satellite systems, along with pre- 
processing to geometrically correct and register data sets. 

It is recommended that attention be given to developing more 
effective methods of scene stratification and obtaining crop 
yield information from Landsat data. 

The rationale and background of the investigation are 
described in Section 1.0; the objectives follow in Section 2.0. 
In Sections 3.0 and 4.0 the test areas and experimental 
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approach and procedures are described. The results of the 
investigation are presented in Sections 5.0 and 6.0. The 
significant results and conclusions of fhe investigation are 
given in Section 7.0, followed by the recommendations in 
Section 8.0. 
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1.0 INTRODUCTION 


As our grain reserves become depleted and world popula- 
tion and demand for food increase, the need to improve the 
quality of world crop production information becomes ever 
more critical; Accurate and timely crop production informa- 
tion has been identified at the World Food Conference held 
in Rome in 1974 [25] and more recently in a National Academy 
of Science study [20] as a critical part of the solution of 
the food problem. 

During the past decade considerable evidence has devel- 
oped that multispectral remote sensing from aerospace plat- 
forms can provide quantitative data which can be effectively 
used to identify major crop species and determine their 
areal extent. Remote sensing techniques may prove to be a 
more accurate, precise, timely, and/or cost effective method 
of acquiring crop production information than conventional 
surveys carried out on the ground. The information gained 
from this investigation should provide additional data on 
which to determine the utility of remote sensing. 
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1.1 Value of Crop Production Information 


Most countries forecast and estimate their crop produc- 
tion, but relatively few have reliable methods for gathering, 
the necessary data. The benefits of improved crop informa- 
tion are: (1) accurate estimates result in price stability; 

(2) timely and accurate forecasts of production allow gov- 
ernments to plan domestic and foreign policies and actions; 
and (33 accurate forecasts enable optimal utilization of 
storage, transportation, and processing facilities. Con- 
versely, the socioeconomic costs of not having accurate and 
timely information available are substantial. 

The economic value of increased crop forecast accuracy 
in the United States was first quantified by Hayafni and Peter- 
son [12]. They estimated from their model that a reduction 
in forecast e_r_r_o_r for wheat from 3.2% to 2.1% would have 
annual net social benefits of 70 million dollars at 1968 
prices- -a figure which would be approximately doubled at 
1974-1976 prices i On a world basis the value of improved 
forecast would be substantially greater. Comparable bene- 
fits would be gained by improving the accuracy of estimates 
for other major crops. 

In addition, more frequent information, such as might 
be provided with remote sensing techniques, would increase 
the social benefits even without improvements in the crop 
estimate error [10]. 
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1.2 Conventional 'Crop Survey Methods 


Information gathering is as old as civilization. Census- 
taking by the Egyptian Pharaohs and Roman Emperors are 'good 
examples. However, the application of scientific statistical 
methods to gathering agricultural statistics is only about 
a hundred years old. But, in spite of many technological 
advances in the methods used to survey crops, many countries 
still do not have adequate systems to gather data needed to 
support” satisfactory decision making about food and nutri- 
tion. 

The system developed in the United States is regarded 
as. being one of the most comprehensive and accurate. In 
this country the Statistical Reporting Service of the 
Department of Agriculture (USDA/SRS ) has. responsibility for 
collecting and reporting current data on U.S. agriculture. 

The present program of crop and livestock estimation annu- 
ally includes over 500 national reports, plus numerous 
reports issued by individual states. Reports are made for 
more than 120 crop commodities (including field and seed 
crops, vegetables, fruits, and nuts) and provide estimates 
of acreages farmers intend to plant; acreages actually 
planted and harvested; yield, production arid crop disposi- 
tion; as well as periodic indications of remaining stocks 
for importarit crops. Monthly forecasts of production are 
prepared for major crops throughout the growing season. 
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Nearly all surveys conducted by SRS are probability 
surveys based on relatively small samples. Since 1965 a 
national general purpose survey including 17,000 area seg- 
ments which are enumerated during May and June each year 
has been used. The sampling units or area segments are 
typically about 2.6 square kilometers (about one square 
mile) in size. This sample is stratified with states and 
areas within states serving as strata. Crop reporting dis- 
tricts (CRD), groupings of contiguous counties having sim- 
ilar-agricultural practices, are generally the intrastate 
strata. Sample selection within strata follows a system- 
atic approach using a geographically arranged listing of 
the sampling frame. Trained enumerators visit each seg- 
ment and interview each farm operator to obtain data on 
crop acreages,- livestock production, production costs,, 
and prices received. About 20% of the questionnaire con- 
cerns crop acreage information. Additional information 
describing the SRS sampling and estimation procedures may 
be found in references (23] and [ 7 ] . 

The current SRS probability surveys provide indepen- 
dent estimates with known measures of precision (sampling, 
errors) . Typical sampling errors for several major crops 
are shown in Table 1. It should be noted here the SRS 
surveys are designed to produce accurate, precise estimates 
at the national level. At the state level where there are 
generally 300-400 sampling units, the sampling error is 
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Table 1. Coefficients of variation from June Enumerative 

* and Objective Yield Surveys in the United States, 
1975. 


Crop 

Coefficient 
Acres Planted 

of Variation C 
Yield 

Production 

Winter Wheat 

1.5 

1.0 

2. 0 

Corn 

1.1 

0.9 • 

1. 7 

Soybeans 

3.5 

1.0 

2.1 

Cotton 

3.5 

1.0 

3.7 


a From Caudill [ 7 ] . 
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greater; coefficients of variation are typically 4-6%, 


Estimates for counties are not obtained from the June 
enumerative survey since there are too few segments per 
county to be reliable. Rather, the estimate of the total 
acreage of, for example, wheat in the state is obtained 
and then subdivided among counties. The county allocations 
are based on a mail survey which may include 50-100 respon- 
dents per county and/or the last agricultural census. Var- 
iance estimates are not calculated by the SRS for county 
estimates, but the coefficients of variation are believed to 
be on the order of 10% or more. 

1.3 Development of Remote Sensing Technology for Crop Surveys 

To understand the approach used and results from this 
investigation it_will be helpful to briefly review the devel- 
opment of remote sensing technology related to crop surveys. 
This historical perspective will indicate the progress which 
has been made and the contribution of this investigation. 

Remote sensing from satellites is particularly appro- 
priate for crop surveys because of the capability to obtain 
repetitive coverage of wide areas. The physical basis for 
remote sensing, data acquisition platforms and sensors, and 
data analysis techniques are described by Bauer [ 3 ] in a 

review of the potential role of remote sensing in determining 

» 

the distribution and yield of crops. 
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Remote sensing as it is known today is an outgrowth of 

• f 1 . 

aerial photography. Although the use of aerial photography 
has heen developing for more than a hundred years, remote 
sensing has been evolving and expanding most rapidly since 
1960 as new sensors and interpretation techniques became 
available .’ 

In 1964, mu-ltispectral photography was collected for 

the first time over agricultural fields, and the potential 

of the multispectral approach to crop identification was 

recognized [13] . After this approach was further defined, a 

crop classification was made from multispectral scanner data 

in 1967, using pattern recognition methods implemented on a 

* 

digital computer [17]. 

One of the first investigations using satellite-acquired 
imagery to identify crops was performed by Anuta and 
MacDonald [2]. Apollo-9 multispectral photography was digi- 
tized and analyzed using computer- implemented pattern recog- 
nition techniques. Agricultural land in the Imperial Valley 
of California was ' accurately classified into several individ- 
ual crops, soil, and water. 

The Corn Blight Watch Experiment, conducted in 1971 by 
NASA, US DA, Purdue University, 'and the University of Michigan 
in seven Corn Belt states, provided a prototype remote sens- 
ing system [18]. It successfully integrated techniques of 
sampling, data acquisition, storage, retrieval,- processing, 
analysis., and information dissemination in a quasi- operational 
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system environment. The results showed that remote sensing 
could be used to ^quantitatively ' recognize corn leaf blight, 
as well as other agricultural crops and land uses over 
broad areas. 

The supply of remotely sensed data greatly increased 
with the launch of Landsat-1 '(formerly called the Earth 
Resources Technology Satellite or ERTS-1) in 1972, From an 
orbit 912 km above the earth, the satellite can complete a 
full observation of the earth every 18 days. Its multispec- 
tral imagery is collected in four visible and infrared wave- 
length bands over 185 km wide passes over the earth. This 
newest data source with its synoptic view of earth has opened 
a whole new dimension to the capability to obtain information 
about earth resources. 

Bauer and Cipra [4 ] used multivariate pattern recogni- 
tion methods implemented on a digital computer to classify 
Landsat-1 data acquired over a three- county area in northern 
Illinois. The classification ' of the Landsat data, as mea- 
sured by an independent sample of test fields, was 85% accu- 
rate on a point by point basis '(Table 2). Although there 
were errors in the classification of individual data points, 
area estimates made oveT the three- county area were within a 
few percent of those made by the U.S. Department of Agricul- 
ture (Table 3) . 
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Table 2. Classification of corn, soybean, and "other" test 
fields by computer-aided analysis of Landsat-1 
multispectral scanner data for DeKalb County, 

. Illinois. 


Crop 

Number 

of 

points 

Number of points classified as 

Percent 

correctly 

classified 

Corn 

Soybeans 

"Other" 

Corn 

3968 

3367 

357 

244 ■ 

85 

Soybeans 

1113 

115 

855 

133 

77 

"Other" 

295 

16 

50 

234 

79 


5376 

3498 

1262 

611 

83 


a From Bauer and Cipra [ 4 ] . 


Table 3. Comparison of area estimates made by U.S. Department 
of Agriculture and from classification of Landsat-1 
multispectral scanner data for DeKalb, Ogle, and 
Lee . Counties , Illinois. a 



Percent 

of total area 

Crop 

US DA 

LANDSAT 

Corn 

40.2 

39.6 

Soybeans 

18.0 

17.8 

Other 

41.8 

42.6 


From Bauer and Cipra [ 4 ] . 
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2 . 0 OBJECTIVES 


The long term objective of this investigation is to 
develop and test procedures utilizing Landsat data' to not- only 
identify, but more importantly, determine the areal extent 
and distribution of earth surface features over large geo- 
graphic areas. The specific applications selected for this 
investigation are crop identification and area estimation for 
two states in the Central United States. 

There is high probability that improved crop production 
information, long recognized as a potential application of 
remote sensing, can be obtained from Landsat data. The wide 
area coverage of— Landsat, linked with computer processing, 
offers a unique opportunity to improve upon the sampling 
methods now used for making area estimates from ground-based 
systems. This is particularly true as the size of the area 
decreases, e.g. state, district, county. Further, the sequen- 
tial coverage of Landsat should lead to improvements in the 
timeliness of the estimates. Both of these aspects would re- 
sult in economic and social benefits. 
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The specific objectives of this study are: 

Using Landsat data and computer- implemented 
pattern recognition, classify the major crops 
s from regions encompassing different climates, 

t soils, and crops. 

Estimate crop areas for county and state size 
areas using the crop identification data ob- 
tained from the Landsat classifications. 

’ i 

Evaluate the accuracy, precision, and timeli- 
ness of crop area estimates obtained from 
Landsat data. 

Two important underlying premises to be tested in the 
investigation are: 

The synoptic view of Landsat provides the 
opportunity to obtain crop production 
information over large areas, e.g. states 
and countries. 

By using computer- implemented data analysis 
to classify pixels distributed over entire 
, counties, it is also possible to- make accurate 

and precise estimates for local areas, e.g. counties 

The successful accomplishment of the investigation would 


contribute to the development of earth resources surveys by: 

Leading to operational use of satellite data 
for obtaining crop area estimates . 

- • Refining techniques which could also be 

applied to other problems such as crop yield 
forecasts, natural resource inventories, and 
measurement and monitoring of damage caused 
by floods, drought, insects and disease. 

Developing improved methods of obtaining 
necessary ground truth . 

Testing statistical sampling models designed 
specifically for remote sensing applications. 

Providing data for determining needed 
information on costs and benefits of 
obtaining information using remote sensing. 
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3.0 SELECTION AND DESCRIPTION OF TEST AREAS AND CROPS 


Kansas and Indiana were selected as the test states for 
this investigation. Winter wheat in Kansas and corn and soy- 
beans in Indiana were selected as the crops for which area 
estimates would be made from classifications of Landsat data. 

The test areas and crops were selected to sample the 
range of conditions which are present in the Great Plains and 
Corn Belt regions of the United States. The selections of 
test areas and crops were made taking into account the spec- 
tral and spatial parameters of the Landsat data and the charac- 
teristics of crop production. On the "spectrum of difficulty", 
wheat identification in Kansas is undoubtedly an easier problem 
than corn and soybean identification in Indiana. That is, the 
Landsat data is likely to be more adequate for winter wheat 
identification in Kansas than for corn and soybean identifica- 
tion in Indiana. 

Winter wheat is the first crop to "green-up" in the 
spring, has the greatest amount of green biomass (except for 
alfalfa) during the April to mid-June period, and at maturity 
in late June and early July is the only cover type 'which is 
golden-yellow in color. In other words, during much of its 
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growth cycle .it is dissimilar from the other cover types 
present. Additional factors simplifying the task of wheat 
identification and area estimation in Kansas is that wheat is 
grown in relatively large. fields , on a large percentage of the 
agricultural land, and with relatively few other cover types 
and crops present. 

In comparison, corn and soybeans in Indiana. are warm 
season or summer crops .which are green at the same,, time as 
many other cover types present during the summer in- Indiana-. 
Some of the possible - "confusion" cover types .include trees, 
pasture, forage crops, and oats. Secondly, field sizes in 
Indiana are much smaller than in Kansas. This is due .to the 
greater heteorogeneity in soils and the greater number of 
crops being grown.- The smaller field sizes .cause a greater 
fraction of pixels to fall on .field boundaries and include 
more than one c ove r type. In summary, -corn and soybeans .in 
Indiana are more like the classes they are to be' discriminated 
from than is the case with winter wheat In Kansas. 

.Kansas is the number one wheat producing .state in the 
nation [16]. ' Its wheat production for 1975 "totaled 9.6 million 
metric tons (3^1 million bushels) , .10% above 1974 and second 
only to the record 10.5 -million metric tons (385 million 
bushels) produced in 1973. The 1975 crop was seeded on 5.2.' 
million hectares (12.8 million acres), 7% more than a year 
earlier. Area harvested for grain, at 4.9 million hectares 
(12.1 million acres), was 4% above the previous year. 
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Abandonment, at 5.5%, was slightly above recent years but 
well within normal rates of abandonment. The average yield of 
19.5 quintals per hectare (29 bushels per harvested acre) was 
1.0 quintal (1.5 bushels) above the 18. 5. quintal (27.5 bushel) 
average in 1974. The distribution of wheat production in the 
state is shown in Figure 1. The farm value of the 1975 wheat 
crop in Kansas was 1.2 billion dollars. 

Kansas soils were developed under mixed or short prairie 
grass vegetation. Average precipitation varies -from 38 
centimeters (15 inches) in the west to. 81 centimeters 
(32 inches) in the east. The climate is continental in most 
of the state, becoming semi-arid in the west. The distribution 
and amount of precipitation during the year fit the requirements 
of winter wheat better than any other crop in much of the state. 
Other important crops grown include corn, grain sorghum, and 
alfalfa. The amount of irrigated land is increasing each year. 
There were 20.2 _ million hectares (49.9 million acres) of land 
in farms in 1975-; crops were harvested from 12 million hectares 
(30 million acres) . 

In 1975 Indiana ranked third among the states in both 
corn and soybean production [15]. The 2.3 million hectares 
(5.6 million acres) of qorn harvested was a record high. The 
average corn yield was 59 quintals per hectare (98 bushels 
per acre). Production at 13.5 million metric tons (552 million 
bushels) was the second largest crop on record. The area in 
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WHEAT — Bushels Produced by Counties 1975 

Rank of First Ten Counties Shown by Number Within County 



Figure 1. The distribution of 1975 wheat production in Kansas. 
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soybeans was 1.5 million hectares (3.6 million acres), 7% below 
the previous year. The 20.7 quintal (33 bushel) average yield 
was a record high and total production of 3.0 million metric 
tons (120 million bushels) was the second greatest ever. The 
distributions of Indiana corn and soybeans are shown in 
Figure 2. 

Indiana includes both glacial and non-glacial soils, with 
topography ranging from the nearly level prairies of northern 
and central parts of the state to the rolling and steep lands 
of the southern areas of the state. Both dark colored soils 
developed under prairie vegetation and light colored soils 
developed under forest are present. The climate is typically' 
continental with cold winters, warm summers, and frequent 
short period fluctuations of temperature, humidity, cloudiness, 
and wind direction. The well-distributed annual precipitation 
of 81 to 102 centimeters (32 to 40 inches) favors high 
agricultural production. Sunshine averages more than 70% of 
its possible duration for the summer months and summer precipi- 
tation occurs mostly during short duration showers or thunder- 
storms . 
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4.0 EXPERIMENTAL APPROACH AND PROCEDURES 


The approach used in the investigation huilt on- proce- 
dures developed and utilized in previous research at LARS 
with the objective of extending them to larger areas. The 
procedures were developed upon five fundamentals which were 
determined early 'in the investigation: 

The classifier would be trained and tested using 
aerial photography as reference data. 

Counties without reference data would be classi- 
fied using training statistics from an adjacent 
county having similar crops and soils and lying 
in the same Landsat frame. 

Area estimates would be made from a systematic 
random sample of pixels distributed over the 
entire_county . 

Area estimates would be made on a county basis 
and aggregated to district and stbte levels. 

Estimates would be adjusted for classification 
bias. 

The implementation of the basic steps is illustrated in 
Figure 3. The remainder of this section describes in detail 
the procedures used in the investigation. 
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4.1 Acquisition and Selection of Landsat Data 


At the beginning of the project a standing order was 
placed with the EROS Data Center for Landsat- 2 photographic 
imagery over Kansas and Indiana. . The imagery was the basis 
for decisions of the choice of scenes to be used for classi- 
fication. If a- scene was chosen for use, the bulk computer 
compatible tape was then ordered retrospectively. Landsat-2 
was the primary source of multispectral scanner (MSS) data, 
with Landsat- 1 scenes being used only to complete the cover- ' 
age tor tne Southwestern Crop Reporting District ( CRD)’ in 
Kansas . 

The selection of a Landsat frame to classify for a 
given county was based upon the date of the Landsat data, the 
location of ground truth, and the amount and location of 
cloud cover. The desired attributes were that the crops of 
interest were spectrally discriminable at the time of the 
Landsat pass; aerial photography was available over areas 
similar in crop stage and soils in the same frame; and both 
the county to be classified and the training areas were not 
obscured by clouds or bad data. 

The Landsat frames chosen for the analysis in Kansas 
and Indiana are shown in Figures 4 and 5, respectively. The 
amount’ of cloud cover created a serious problem for. obtain- 
ing data for much of Indiana and northeastern Kansas. As a 
result, satisfactory data was not available for the Northeast 


20 



1 


3 



Key 



Landsa't Scene 
ID 

LARS Run 
Number ‘ * 

Date 

1 

2165-16450 

75013800 

July 6 

2 

■ 2146-16392 

75005800 

June 17 

3 

2163-16334 

75006500 

July 4 

4 

•2165-16453 

75004600 

July ' 6 

5 

2146-16395 

75005900 

June 17 

6 

2163-16340 • - 

75006600 

July 4 

7 

2144-16282 

75005600 

June 15 

8 

2147-16460 

75006200 

June 18 

9 

5032-16310 

75007200 

May 21 

10 

2073-16342 

75001500 

April 15 

11 

2109-16341 

75005000 

May 11 

12 

20-72-16284 

75000900 

April 9 

13 

2144-16284 

75005700 

June 15 

14. 

2107-16225 

75004900 

May 9 

15 

2142-16171 

75005400 

June 13 


Figure 4. Landsat Coverage for Kansas. 
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Landsat Scene 
ID 

LARS Run 
Number 

Date 

I 

2228-15515 

75009100 

September 7 

2 

2228-15522 

75009200 

September 7 

3 

2209-15464 

75009000 

August 19 

4 

2173-15480 

75008700 

July 14 

5 

2208-15405 

75010000 

August 18 

6 

2208-15412 

75010100 

Augus t 18 


Figure 5. Landsat Coverage for Indiana. 
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and East Central CRDs In Kansas. In Indiana, the only dis- 
tricts that ha'd complete Landsat coverage were the Northwest- 
ern, West Central, Central and East Central. . 

Tables 4. and 5 illustrate the- cloud cover problem. The 
standing order for Landsat-2 photographic imagery requested 
scenes that- contained less than 50% cloud cover. Since a 
low cloud cover- percentage does not necessarily mean that a 
scene is usable, for analysis, the number of usable scenes is 
specified in Tables 4 and 5. For example, a frame could be 
half in Indiana and half in Illinois. If the frame has 10-20 
cloud cover but the clouds cover the Indiana portion of the 
frame, it is unusable. Or, if there are three or four large 
cloud patches -which occur as long streaks across the frame, 
the frame is unusable even though the cloud cover may have 
only been 20%. The magnitude of the cloud cover problem is 
indicated in the tallies of data acquired and data used which 
show that only 21 out of 93 frames in Kansas and only eight 
out of 40 in Indiana were usable. 

In Kansas.;, -there was -April- da t a~"a v a i 1 abl e to cover the 
entire south central CRD and data in May and June to provide 

l 

duplicate coverage for ten of' the thirteen counties. It was 
decided to analyze these ten counties twice and compare the 
results. Figure 4 indicates which counties were analyzed 
twice and which frames and dates were used. In the statis- 
tical analysis of the results for Kansas, both dates, ^were 
used* for most of the statistical tests. However, the tables 
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Table 4. Summary of acquisition and usability of Landsat-2 
data for Kansas, April 1 - July 17, 1975. 


Month 

No. Frames 
Acquired by 
NASA/GSFC 

No. Frames 
Received from 
EROS Data Center* 

No. 

Usable 

Frames 

April 

29 

8 

6 

May 

28 

9 

2 

June 

18 

15 

9 

July 

18 

9 

4 

Total 

93 

41 

21 

*Standing order 

for all frames 

with < 50% cloud 

cover. 


Table 5. 

Summary of acquisition and usability of 
data for Indiana, July 1 - September 7, 

Landsat- 2 
1975. 

Month 

No. Frames 
Acquired by 
NASA/GSFC 

No. Frames 
Received from' 
EROS Data Center* 

No. 

Usable 

Frames 

July 

14 

11 

2 

August 

16 

7 

4 

September 

10 

6 

2 

Total 

40 

24 

8 


*Standing order for all frames with < 50% cloud cover. 

« 
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in sections 5.2 to 5.3 display figures only for the second 
date for these ten counties since the second date was closer 
to the time the wheat was harvested. The estimates made at 
harvest time are more important since the SRS estimates for 
area harvested were used for comparison of results. 

4.2 Acquisition of Aerial Photography 

A critical part of the entire investigation involved the 
reference or "ground truth" data set to be utilized in con- 
junction with the computer-aided analysis of the Landsat MSS 
data. Reference data was required for training the classifier 
and 'to test the accuracy of classification. Detailed crop 
type maps, do not exist because the crop grown in an individual 
field generally changes each year. And, indeed some field 
boundaries are changed from year to year. ' Therefore, current 
reference data . sets . had to be acquired to support the planned 
Landsat data analysis. 

In many previous agricultural .remote sensing experiments", 
reference data were obtained by on- the-ground identification 
and recording of crop type and other information by the 
researchers or local. USDA personnel, .But, the amount of data 
which can be obtained in this way is restricted by the time 
and personnel available and generally can be done for only a 

few relatively small areas. Resources were not available to 

* 

implement such an effort, even using sampling, for two 
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entire states. 

During the CITARS project conducted by NASA/ JSC , LARS,- 
and ERIM, this type of .ground observations was supplemented 
by interpreting aerial color infrared photography acquired 
concurrently and over the same area as the ground observa- 
tions [ 5 ]. The accuracies of crop identification by photo- 
interpretation routinely exceeded 95% and the data were 
successfully used for training and test purposes. It was 
therefore decided to take this approach one step further 
and make aerial photography the primary reference data source 
to identify and locate samples of wheat, corn, soybeans, and 
other cover types in .the Landsat data. 

After studying soil, climatology, and land use maps, 
flightlines were selected throughout each state to sample the 
variation in soils, land use, and crops. The flightlines 
were oriented north-south following major, highways, in Kansas 
and Indiana so that the aerial photography and Landsat data 
could be coordinated easily. 

A 70 mm Hulcher two-camera system was used with color 
infrared and color transparency film. The average ground 
speed was 275 km per hour and photographs were taken, with 
both cameras, at intervals of 38 seconds, producing a contin- 
uous strip of imagery with an overlap of 25-30%. The average 
altitude for each flight mission was 3,000 meters. The 
approximate scale of the photography was 1:80,000. Each frame 
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of aerial photography included an area roughly, four kilometers 
square (2.5 x 2.5 square miles). Examples of the photography 
are shown in Figures 10 and 11. 

In Kansas, aerial photography was acquired on April 29-30 
and June' 26-27, Both dates were quite adequate for different! 
ating wheat from all other cover types. The June mission 
covered the eastern counties (and some western counties) while 
the April one covered the rest of the state (Figure 6),. 

The flightlines and dates of aerial photography acquisi- 
tion for Indiana are shown In Figure 7. The May photography, 
when used concurrently with the July or August phonography, 
helped to differentiate corn and soybeans from all other 
fields . 

4,3 Digitization of Coordinates 

The Lands at coordinates for county boundaries were needed 
in order to make county crop estimates. In addition, three 
to eight points were needed along the flightline in a county 
in order for the analyst to match a computer map* of Landsat 
data to the aerial photography. To 'find coordinates., the* 
following procedure was used: 

1. Determine which counties are contained in the 
Landsat scene. 

2. Locate 25-30 checkpoints in the Landsat scene. 

3. Digitize- these .checkpoints on a 1:250,000 USGS map. 
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Figure 6. Kansas aerial photography flightlines and dates of photography acquisition. 
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4. 


Digitize points defining county boundaries. 

5. For each county that has aerial photography, 
digitize three to eight points along the flight- 
line. 

6. Use a bivariate quadratic regression routine to 
fit coordinates of the checkpoints from the 
Landsat scene to the corresponding coordinates 
on the USGS maps. Then calculate Landsat coor- 
dinates for points defining county boundaries 
and checkpoints along the flightline. 

7. Record the Landsat coordinates , for county bound- 
aries, and mark the Landsat coordinates for 
flightline points on the county maps. 

In the following paragraphs each of the steps is described 

further. 

The outlines of the state and all the county .boundaries - 
are displayed on a digital display device. Using the lati- 
tude and longitude for the Landsat scene center, the outline 
of the scene can be superimposed. A photograph taken of 
this image aids in determining which counties are covered. 

In order to locate checkpoints, the data was displayed 
one channel at a time, in 16 gray levels. Twenty- five to 
30 checkpoints were found, generally at the intersection of 
two highways, and the Landsat coordinates of these- points 
were recorded. 

The' (pc, y) coordinates of the checkpoints found in the 
Landsat scene, the points defining the county boundaries, 
and additional checkpoints along the flightlines are obtained 
from USGS 1:25 0,000 scale maps. A regression routine' was 
used to fit the Landsat checkpoints to the checkpoints 
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digitized from the USGS maps. The Landsat coordinates of 
the county boundaries and additional points along the flight- 
lines were then listed, and recorded on maps (Figures 8 and 9) . 
The Landsat coordinates of the county boundaries were later 
used for tabulating county classification results. The 
coordinates of the points along the flightlines were used 
by the analysts to locate the flightlines in' the Landsat 
data. 

4.4 Interpretation of Aerial Photography 

Large scale aerial photography was used as reference data 
following the assumption that the crops of interest could be 
readily and accurately identified. Standard photointerpre- 
tation techniques were used to identify fields of wheat and 
nonwheat in Kansas and fields of corn, soybeans, and ’’other" 
in Indiana. The coordinates of the identified fields were 
then located in Landsat data. Wheat was relatively easy to 
identify in Kansas; corn and soybeans were more difficult 
to identify, in Indiana. Fields which were no.t positively 
identified were not included as either training or test fields. 
Problems in photo interpretation , therefore, resulted in smaller 
training sets rather than inaccurate identification. Two 
general problems, clouds or haze and improper film exposure, 
were occasionally encountered, but did not seriously affect 
the photointerpretation process . ‘ 
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of Landsat coordinates of county boundaries. 
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County map showing aerial flightline and Landsat 
coordinates of points along it (Harvey County, Kansas) . 



Examples of the aerial photography over Kansas and 
Indiana are shown in Figures 10 and 11, respectively. These 
figures illustrate scale, quality, and appearance of major 
cover types. The difference in the number and size of fields 
in a section of land in the two states is also illustrated. 
4.4.1 Kansas Wheat 

Photography acquired on April 30, 1975, was used as ref- 
erence data for all of Kansas except the Southeast CRD. On 
this date the wheat fields had nearly total ground cover 
and were light green compared to alfalfa or clover and wheat 
during May. Clover and alfalfa were the only other crops 
achieving full ground cover and a bright green color at this 
time in the season. Confusion of wheat with these crops was 
occasionally a problem, but generally clover and alfalfa were 
brighter red on the color infrared film and could be discrim- 
inated from wheat. The planting patterns in wheat fields 
also helped in its identification. Pastures could usually 
be easily separated from wheat fields in the infrared photo- 
graphy. Color infrared photography was used exclusively for 
this date. 

Photography of June 26-27, 1975, was used for a limited 
area in the southeast part of the state. By this date, winter 
wheat was mature and harvest was ready to begin. Thus, with 
the straw dead, the wheat fields are golden yellow, a color 
which readily separates them from any other major feature 
present at this time. Primarily the Ektachrome color positive 
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Figure 10. Examples of color infrared and color aerial 

photography acquired over Finney County, Kansas 
on April 20 and June 27, 1975, respectively. 
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Example of color infrared photography 
acquired over Wayne County, Indiana on 
August 20, 1975. 


Figure 11. 
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images were used for the interpretation at this date, since 
the wheat fields could be easily identified on it. 

4.4.2 Indiana Corn and Soybeans 

Almost complete coverage of the Indiana flightlines was 
achieved on May 27, 1975, but corn had not yet emerged and 
soybeans may not even have been planted at this time. Photo- 
graphy from this date, however, was useful in separating 
corn and soybean fields from other fields since corn and 
soybeans are the primary crops appearing as bare soil at this 
time . 

The quality of the photography taken in July over Indiana 
was generally poor; there was a hazy overcast and the film was 
often overexposed. On the infrared film, corn fields appeared 
deep red and were confused with pasture. This photography was 
used only in conjunction with photography from another date. 

During the period from August 20 to September 6, 1975, 
corn fields are tasseled, thus their green color as viewed 
from the air is not as intense. These fields are therefore 
easily separated from the soybean fields, which are at a full 
leaf stage, and have a uniform deep green color. Corn fields 
also exhibit more texture than most other cover types. This 
was the optimum period for obtaining photographic data over 
Indiana during 1975, and it was more extensively used as 
reference data than any of the other time periods. Only the 
color infrared images were used since soybean fields appeared 
as a bright red, and corn fields were of a less intense red 
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4.5 Analysis of Landsat Data 

The Landsat data analysis techniques used in the inves- 

c 

tigation utilized the LARSYS Version 3 multispectral data 

analysis system. LARSYS is the software system, an inte- 

► 

grated set of computer programs, for analyzing remote sensing 
data developed by -Purdue/LARS during the past decade. The 
pattern recognition concept utilized in LARSYS represents a 
powerful and quantitative methodology for accommodating the 
multivariate nature of remote sensing data.. While the LARSYS 
approach takes full advantage of modern computer technology 
for data processing, man is an indispensable part of the 
analysis process. Thus., the techniques are better described 
as "computer- assisted" rather than "automatic"'. The process- 
ing functions of LARSYS are shown in Figure 12. Its theoret- 
ical basis and details of the algorithm implementation are 
described in references [24] and [22], respectively. 

In utilizing the LARSYS software for analyzing multi- 
spectral scanner data, one normally follows a procedure that 
involves: C 1) defining a group of spectral classes (training 
classes) ; (2) specifying these to a statistical algorithm 
which calculates a set of defined statistical parameters; 

(3) utilizing the calculated statistics to "train" a pattern 
recognition algorithm; (4) classifying each data -po'int within 
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Figure 12. Analysis functions of the LARSYS software system. 
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the data set of interest (such as part of a Landsat frame) 
into one of the training classes; and finally’ (5) displaying 
the classification results in either map or tabular format 
(or both) , according to the specifications of the application. 

During the past few years, experience at LARS has shown 
that there are many possible refinements in the methodology 
utilized by the analyst for obtaining training classes, while 
the rest of the procedure does not vary much from one analysis 
task to another. The most common techniques for defining 
training classes involve the so-called ''supervised’' approach, 
and the "unsupervised" or "clustering" approach. 

In the "supervised" approach, the analyst selects fields 
of known cover types and specifies these to the computer as 
training fields, using a system of (x,y) coordinates. The 
statistics are obtained for all categories of cover type in 
each area to be classified. The data are then classified 
and the results evaluated. Because the analyst had defined 
specific areas of known cover types to the computer, such 
classifications are referred to as "supervised". 

The second method uses a clustering algorithm which 
divides the entire area of interest into a number of spectrally 
different classes. The number of spectral classes into which 
the data will be divided must be specified by- the analyst. 

The spectral classes defined by the clustering algorithm are 
then used to classify the data, but at this point the analyst 
does not know what cover type is defined by each of the 
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spectral classes. After the classification is completed, 
the analyst will identify th.e cover type represented by each 
spectral class using available reference data or cover type 
maps. Because -the analyst does not need to define particular 
portions of the data for use as training fields, but must 
only specify to the computer the number of spectral classes 
into which the data is to be divided, a classification using 
this procedure is referred to as "unsupervised". 

Additionally, several variations of these basic methods 
for defining training classes are possible. One is to select 
training areas of known cover type (a supervised approach up 
to this point) , but then utilize the clustering algorithm to 
refine the data into unimodal spectral classes for each cover 
type. This is called a "modified supervised" approach and is 
the approach which was used in this investigation. 

The remainder of this section describes- the analysis 
methodology and additional details of the training procedure. 
An overview of the steps in the analysis sequence is shown 
in Figure 13. 

4.5.1 Selection of Training Bata 

The accuracy of classification results is highly depen- 
dent upon the training data. Selection of training^areas was 
based on two factors: first, the amount and quality of refer- 
ence data (aerial photography) available, and second, the 
presence of a representative sample of cover types of the 
area(s) to be classified. To insure that the best 
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Figure 13. Flowchart of procedures used in 
analysis of Landsat data. 
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classification accuracy is obtained, a sample of every 
spectral class of each cover type should be included in one 
or more of the training areas. This provides a reasonably 
representative training set to the classification algorithm. 

The analyst's first task was to gather and coordinate 
the information available about the county or counties to be 
analyzed. The Lands at scene had been selected (see Sec. 4.1) 
and the Landsat coordinates for each county boundary had 
been found. (see Sec. 4.3) . In addition, county maps had 
been prepared showing the Landsat coordinates of the check- 
points along the aerial photography flightline (/Figure 10) . 

The frame numbers of the aerial photography for each county 
were marked on the map. From this information, the analyst 
could determine the areas in the Landsat data corresponding 
to frames of aerial photography and then select the areas to 
be used for training the classifier. 

Training areas of 100 lines and 100 columns (approxi- 
mately 8 x 5.5 km) of Landsat data were selected in areas 
corresponding to aerial photography. For smaller counties, 
especially in Indiana, three to five training areas were 
chosen covering the entire flightline. In Kansas, four to 
six areas were selected with a.t least one in both the north- 
ern and southern portions of the county in order to adequately 
represent the. variation present in the county. 

To facilitate locating agricultural fields in the Landsat 
data,, a spectral class map was, produced by clustering each 
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training area. The clustering algorithm implemented in 
LARSYS finds natural groupings in the spectral data utilizing 
all four wavebands. Generally six' to eight classes were 
sufficient to provide an image on which the crop fields were 
readily identifiable. This approach was found to' be more 
satisfactory than working with gray scale maps of a single 
spectral band. 

Examples of cluster maps are shown in Figures 14 and 15; 
the color infrared photographs of .the same areas were shown 
in Figures 10 and 11. The cluster maps were matched with the 
corresponding frames of aerial photography, and roads, towns, 
and field boundaries were sketched on the cluster maps. 

Fields were marked on the cluster maps and thei-r cover 
type identified from the aerial photography. During the 
photointerpretation process, the analyst became- familiar with 
the variation in wheat, corn, soybeans, and other fields. 

Training fields had to meet three criteria. First, the 
cover type of the fields selected for training had to be posi 
tively identified by the photo-interpreter. Secondly, the 
fields themselves must be of only one cover type; for example 
if a ditch ran through the field, the analyst would avoid 
the ditch and select samples on either side of it. Thirdly, 
the training fields must adequately represent the variation 
present in the cover types throughout the area to be classi- 
fied; to insure this, the fields -were geographically dis- 
persed throughout, the flightline. The Landsat coordinates 
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Figure 14. Example of cluster map used for location and 

identification of fields in Finney County, Kansas. 

(W = wheat, A = alfalfa, £S = bare soil', P = pasture) 
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Figure 15. Example of cluster map used for location and 
identification of fields in Wayne County, 
Indiana. (C = corn, S = soybeans, 0 = other) 
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of field center (non- boundary) pixels were then obtained and 
field description cards prepared. 

If there were any reservoirs or rivers in the county, 
training samples were obtained for water. If there were no 
bodies of water in the flightline, the analyst obtained an 
additional cluster map which would include water bodies. 
Training samples for water were then selected from this area.' 

As a general rule at least 25 wheat samples and 25 other 
samples were chosen in Kansas. In Indiana, fields were much 
smaller and homogeneous samples- were difficult to find- due to 
the ‘large proportion of boundary pixels. In general, more than 
25 samples each of corn, soybeans, and other were chosen, but 
the samples were small compared to those for Kansas'; 

The number of samples used for training- the cl-dssif ier 
in Kansas and Indiana is shown in Tables- 6 and 7, respectively. 
The median number of fields used for training in Kansas was 
66 and the median number of pixels used was 2600. lit Indiana, 
the corresponding figures are 163 fields and 2750 pixels. 

4.5.2 Development of^ Training Statistics 

The training fields for each major cover type have been- 
selected, but the spectral characteristics of each ci-ass have 
not been ' calculated. Each major cover type must be divided 
into its spectral subclasses, each of which must be a uni- 
modal, distribution to satisfy the assumptions of the-’ maximum 
likelihood, Gaussian classifier and is characterized 1 by its 
mean vector and covarian'c"e‘ matrix. Confusion between' the 
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Table 6. Number of fields and pixels used for training and 
testing the classifier in Kansas. 


County 

Training 

Samples 

Test 

Samples 

No. 

Fields 

No. 

Pixels 

No. 

Fields 

No. 

.Pixels 

Northwest District 





Cheyenne 

47 

1587 



Graham 

59 

1225 



Norton 

30 

600 



Sherman 

76 

2609 

75 

2289 

West Central District 





Greeley 

82 

3090 

81 

2672 

Ness 

82 

2400 



Trego 

50 

2955 

51 

2345 

Wallace 

67 

4139 



Southwest District 





Finney 

127 

2917 



Ford 

119 

3320 

121 

2763 

Hamilton 

117 

7161 

96 

5785 

Haskell 

77 

2118 



Hodgeman 

82 

5105 

83 

4927 

Seward 

43 

1001 



Stapton 

98 

6337 

132 

2884 

North Central District 





Cloud 

77 

1174 



Osborne 

39 

1446 



Ottawa 

56 

3215 



Smith 

97 

2924 



Central District 





Barton 

55 

2928 



McPherson 

57 

2562 



Russell 

42 

1257 



Saline 

50 

1847 

41 

994 

South Central District 





Barber 

58 

1942 

25 

2147 

• Harvey 

69 

2202 



Pratt 

69 

2850 

71 

3433 

Stafford 

62 

2586 

31- 

2522 

Sumner 

49 

2244 



Southeast District 





Allen-Neosho 

126 

4225 

131 

4149 
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Table 7. Number of fields and pixels used for training 
the classifier in Indiana. 


County 

Training 

No. 

Fields 

Samples 

No. 

Pixels 

Northwest District 

Benton 

144 

3271' 

Lake 

163 

3424 

LaPorte 

167 

3976 

Newton 

145 

2684 

Pulaski-Starke 

192 

4475 

White 

224 

3002 

West Central District 

Fountain- Parke 

337 

4419 

Montgomery 

223 

3715 

Owen 

82 

1595 

Tippecanoe 

92 

1685 

Vigo 

120 

2543 

Warren 

63 

1269 

Central Distr-ict 

Decatur 

155 

2748 

Grant 

163 

1690 

Hamilton-Howard-Tipton 

284 

4145 

Johns on- Shelby 

174 

2825 

Madison 

158 

1888 

East Central District 

Fayette • 

110 

1868 

Jay 

166 

1862 

Randolph 

277 

3035 

W ayne 

203 

2617 


spectral subclasses of different cover types must be mini- 
mized to decrease the error in classification. The adequacy 
of the training statistics should be evaluated before carrying 
out large area classifications. 

In order to satisfy the first of these three requirements, 
the cluster function was again used to obtain subclasses for 
the major cover types of wheat and nonwheat in Kansas and 
corn, soybeans, and other in - Indiana. This time, instead of 
one large rectangular area, the field center samples of each 
of the major cover types were clustered separately' to find 
natural groupings or spectral classes within the cover types. 

Statistics were calculated to represent each spectral 
class and the transformed divergence between each pair of 
classes was calculated. The saturating transformed divergence, 
a number between- 0 and 2000, provides a measure of the distance 
between classes in multi-dimensional space. High values indi- 
cate class pairs which are more separable and which, if grouped, 
would yield a bimodal distribution. Class pairs with small 
divergence values are spectrally similar and may be confused 
with each other during classification. If classes. of different 
cover types were spectrally similar, the analyst inspected the 
fields involved by checking the location and type of field on 
both the cluster. map and the aerial photography. If an error 
in field identification or location had been made, the class 
in error -was deleted. If no error occurred, the confusion 
classes were left in the training statistics since deleting 
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one or both of them would have biased the classification 
results . 

Test field classification results, if available, or 
training field results were used to evaluate the adequacy of 
the training statistics before the county was classified in 
order to allow for additional training if required. For many 
counties in Kansas, there were enough sample fields available 
that both a training and a test set could be developed. A 
statistical test showed that the proportion estimates calcu- 
lated using training field performance matrices were not 
significantly different in accuracy from estimates calculated 
using test field performance matrices. In Indiana, where the 
field sizes were small compared to Kansas, the number of 
usable samples was much smaller-, and selecting test fields 
from the sample fields would have greatly reduced the size of 
the training set; 

4.5.3 Classification and Tabulation of County Results 

The final training statistics were used to classify a 
systematic random sample of the Landsat pixels within each 
county (Figure 16) . In a systematic random sample, the first 
sample is chosen randomly and the remainder are determined by 
a constant sampling interval. Systematic random sampling was 
convenient and has the advantages of high precision and excel- 
lent geographic stratification [ 9 ] . 

For about '60 counties in Kansas and a few in Indiana, 
every other line and column was classified, a one- fourth 
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Figure 16. Schematic of a systematic random sample 
' of Landsat pixels classified within a 
county boundary. 
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sample. However, every fourth line and column, a one- 
sixteenth sample, was used for the remainder of the counties. 
Tests showed that there was no significant difference in 
results obtained between these two sample sizes. 

When a county was classified with a training set at 
least partially trained with fields from that county, the 
classification is labelled "local". A "nonlocal" classifi- 
cation is one in which the training set does not contain any 
training fields from the county classified. The training set. 
used to perform a nonlocal classification came from a county 
in the same Landsat frame having similar- soils and land use. 
Figure 17 is a map of Kansas showing geographically the local 
and nonlocal classifications and the source of training data 
for nonlocal classifications. Similar information for the 
counties classified in Indiana is given in Figure 18. Tables 
A1 and A2 in t he a ppendix summarize the Landsat frame, date 
of data, and source of training statistics for all counties 
classified in Kansas and Indiana. 

The number of points of each major cover type and the 
total number of points in the county were tabulated. These 
points fall within an irregular polygon in the Landsat data 
which corresponds to the county boundaries. ^Using the 
coordinates of cities and large towns which had been obtained 
earlier, the number of points of each major cover type in the 
urban areas were tabulated and subtracted from the county 
totals. These adjusted totals form the base of the area and 

S3 








Ul 

4 ^ 



Figure 17, 


Local and nonlocal classifications in Kansas, 
from the source of training statistics to the 
shaded areas denote local recognition counties 


Arrows point 
area classified 




Figure 18. Local and nonlocal classifications in 
Indiana. Arrows point from the source 
of training statistics to the area 
classified; shaded areas denote local 
recognition counties. 
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proportion estimates for the county. 


4.6 Preparation of Area and Variance Estimates 


Following classification, crop area and proportion 
estimates were made. Estimates of the areal extent or propor- 
tion of a cover type were desired for county, crop reporting 
district, and state levels. The county was the smallest unit 
for which an estimate was wanted, so estimates of the cover 
types of interest were made for each county and then aggregated 
to the district and state levels. Steps in the area estimation 
procedure included: (1) calculation of the area- and proportion . 

estimates, (2) correction of' the estimates for classif ication 
bias, and (3) calculation of variance estimates. For counties 
in which Landsat classifications were, not performed, a regres- 
sion procedure utilizing historical data and current Landsat 
estimates was used. 

4.6.1 Area and Proportion Estimates 

The Landsat estimated proportion of the i— crop in the j th 
county was calculated using the equation 


£.. = iii 

ij n . 


where n^ is the number of pixels classified as crop i and 

n^ is the total number of pixels in the' sample. The esti- 

til 

mated hectares of crop i in the j — county can be calculated 


in two equivalent ways: 
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h- - = p - -h . 
ij 3 

A 

where p^. is defined as above and i.s the number of hectares 
in the county', or 

. A 

h- • = k n. . 

where n - - is as above and k is the area in hectares of a pixel 

13 

(approximately 0.45). 

Area and proportion estimates for the crop reporting 
districts and the entire state are aggregated from the county 
estimates. The area estimate of crop i for a CRD .is found 

A 

by £ h..j summing the area estimates from all the counties in 
X3 

the CRD. The proportion of crop i in a CRD is found by 

^ij where the summations are taken over all the counties in 
“£1TT 

the J CRD and h^ and lu are as defined above. Area and propor- 
tion estimates for entire states are found similarly. 

4.6.2 Correction_for Classification Bias 


Experience has shown that it is inevitable' that some 
pixels are incorrectly identified by the maximum likelihood 
classifier. The primary source of these errors is overlapping 
density functions for two or more classes. For example, some 
corn looks like soybeans and/or some soybeans are spectrally 
similar to corn. Classification errors of this type cause 
the resulting area estimates to be biased. However, if the 
error rates are known the area estimates can. be adjusted or 
unbiased after the classification has been performed. This 
technique was first used in the 1971 Corn Blight Watch 
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Experiment [18] and later in a Landsat-1 investigation by 
LARS [ 4 ] . 

An estimate of the classification error rates is the 

matrix of training or test field classification performance, 

/e 11 e 1 2 | 

E =^2 1 &22 J 

where e^. is the proportion of samples of type i classified 
as type j. If P is the vector of true proportions of the 

A 

cover types and P the proportions estimated from the Landsat 
data, then 

~ t 
P * E X P . 

A. 4 

Since P and E are known from the classification, but P, the 
vector of true proportions, is not known, 

p = (e 1 )" 1 ? 

is solved. The example of Figure 19 shows how this is done. 

It is pos sibl e for this method to give a negative value 

for the proportion of a cover type. Since it is .unrealistic 

for an estimate' of a proportion or probability to be negative, 

an alternative problem was considered when this occurred: 

min ( +\ i ^ 

°< Pi <l P - (e j p 

l 

for all p^, elements of the vector P. This is equivalent to 
minimizing the Euclidean distance (denoted by J j ’ | | ) between 
the true proportion and the Landsat corrected estimate. The 
vector of proportion estimates after bias correction is 
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(E T ) 


Figure 




-1 


1.2239 

-.2239 


-.2687 

1.2687 




1,2239 

-.2239 


-.2687 

•l.?687 




SRS ' 

Harvested 


Landsat 

Uncorrected 


Landsat 

Corrected 


31.6 


38.9 31.2 


19. A numerical example of classification bias 
correction (Cloud County, Kansas). 
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denoted by P. The discussion of bias correction generalizes 

to n cover types' of interest with E being an n x n matrix 

and the vectors having n components . 

The corrected estimate will be unbiased if the error 

matrix found from. the test or training field performance is 

the true error matrix. It may not be truly unbiased because of 

photointerpretation difficulties or because the flightline 

might not be representative of the entire area classified. 

4.6.5 Calculation of Variance Estimates 

In addition to knowing the accuracy of an estimate, it is 

desirable to know the precision, or variance, of the estimate. 

The variances of the proportion and area estimates were 

obtained as follows. Since each pixel is classified as crop 

i or not, the binomial distribution can be used to obtain the 

variance of the bias -corrected proportion estimates. For the 
til 

j — county, an estimate of the variance is given by 



where f. is the county sampling fraction [ 8 ] . For individual 
county estimates, the sampling fraction can be ignored (though 
it is not negligible] to give a conservative estimate of the 
variance. As 




p - . h. 


the variance of the area estimate h. . can be calculated by 
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v h, - = h. V p,. 


where h^ is the total number of hectares in the county. 

In calculating the proportion estimate from the sample 
the assumption is made that each pixel would be classified as 
a particular crop or not classified as that crop, which leads 
to a multinomial or binomial model of the classified 4 ata * 

The binomial distribution can be used to estimate the total 
number of wheat pixels and the percentage of wheat in the 
area. Theoretical estimates of the sampling error are then 
available [8]. It is also assumed that there is no cyclic 
pattern in the data to bias the estimate frpm a sample taken 
systematically. To test these assumptions, a sampling study 
was performed early in this project. 

The study examined the sampling error produced for a 
given sampling fraction against the theoretical error given 
by using binomial distribution theory. In order to measure 
just the effect of sampling, the error introduced in classi- 
fication was ignored by comparing the various samples to a 
100% sample. The results are based on classifications of Rice 
and Morton Counties, Kansas, and were substantiated by further 
tests in Benton and Wayne Counties, Indiana. 

In the Kansas sampling study, estimates of both the total 
number of wheat resolution elements and the percentage of 
wheat in the area were calculated for sampling fractions of 
50, 33.3, 25, 11.1, 10, 6.25, 4, and 2.8 percent. These 
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samples were taken systematically. For example, an 11 . 1 % sam- 
ple of- the area was obtained by tabulating the classification 
with both a line and column interval of three. Nine 11 . 1 % 
samples were selected with a different starting point for each 
sample. The theoretical variance of these sample estimates 
was calculated from the binomial distribution and compared to 
the variance among the repeated estimates of the same sample 
size. For example, the theoretical - variance of an 11 , 1 % sample 
was calculated and then compared to the variance of the nine 
sample estimates. 

The results of the study (Table 8) showed that in all 
cases the two variances were not significantly different, 
indicating that the theoretical estimate of the sampling error 
based on the binomial distribution can be used as the estimate 
of the variance of the proportion estimate. The Morton results 
show a cyclic effect- due to "six line scan" noise. In prac- 
tice, Landsat data with such a noise problem was avoided. 

Wayne and Benton Counties in Indiana were used to test the 
applicability of the Kansas results to Indiana. The results 
were consistent with those of Kansas. 

The variance for a crop reporting district can be obtained 
in two ways. The variance can be calculated as though a sys- 
tematic random sample were taken throughout the district or 
it can be calculated considering each county as a stratum. The 
estimated variance for crop i in the stratified case would be 
given by: 
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Table 8. Theoretical and computed sampling errors of wheat 
proportion estimates for different sample sizes 
in two counties in Kansas. 


% Sample 


Standard 

Theoretical 

Error (%) 

Computed 

Rice County 

50.0 


0.0902 

0.0361, 0.1126* 

33,3 


0.1277 

0.1018, 0.1597 

25.0 


0.1563 

0.0992 

11.1 


0.2555 

0.1824 

10,0 


0.2717 

. 0.1752, 0.1937 

6.25 


0.3509 

0.2812 

4.0 


0.4453 

0.2797 

2.8 


0.5358 

0.4890 

Morton County 

50.0 • 


0.0867 

0.1293, 0.9233 

33.3 


0,1226 

0.0430, 1.0067 

25,0 


0.1501 

0.7637 

11.1 


0.2455 

0.8799 

10.0 


0.2599 

0.3358, 0.6939 

6.25 


0.3372 

0.6948 

4.0 


.0. 4241 

0.3405 

2.8 


0.5152 

2.6950 


* 50.0%j 33 . 3 % and 10% systematic samples can be taken in two 
ways. For example, a 50% Sample can be either every other line 
or every other column. 



where the summation is taken over all counties in the crop 
reporting district [ 8 ] . 

In essence, it matters little what formula, is used, to 
calculate the variance estimates whether conservative or not, 
because the estimates are very small in either case. The 
distribution in Indiana is actually given by the multinomial, 
but the variances can be calculated by considering each crop 
separately with the binomial assumptions-. 

4.6.4 Estimation for Counties Without Landsat Data 

An alternative approach for crop estimation must be taken 
when adequate data for Landsat classification is not available 
for an area. One approach to this problem lies in formulating 
a regression equation from which a crop prediction can be made. 

Regression is valid as a predictor only for the popula- 
tion from which it is derived. This predictor will not be valid 
for a county which has historical crop acreage, or county size 
falling outside the range of values used in the derivation of 
the regression equation. For these counties, the 1974 USDA/SRS 
area estimates were used as the 1975 estimates. Revised 
estimates from Kansas and preliminary estimates, from Indiana 
were used. 

For Kansas, the regression model used to predict the area 
in hectares of wheat in a given county was : 
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y - So + 81 X 1 + 32 X 2 + 33 X 3 


where Xi is the 1974 USDA/SRS wheat acreage for the county, x 2 
is the 1973 USDA/SRS wheat acreage for the county, and x 3 is 
the total, number of acres in the county. The coefficients 
So, 81 > 82 > and 83 are estimated by using the available Landsat 
estimates as y values. A pseudo-Landsat estimate is made by 
applying these coefficients to the x values of the counties to 
be estimated. 

Only historical data could be used in the regression in 
order to simulate real-time estimation. It was felt that 
wheat data before 1973 should not be considered because major 
increases in the wheat acreage planted occurred beginning, in 
1973. The area of the county was also included as a factor 
which might contribute to the amount of wheat grown. 

For Indiana, similar regression models were used to 
predict the area in corn and soybeans. Again, the variables 
considered as predictors were the number of acres in the county 
and the USDA/SRS estimates of acres harvested in 1973 and 1974 
for corn or soybeans. The regression model used was: 


y.=8 +.8Xj+8x + 8 x 

0 P 1 li 2 2 i 3 3 

A 

where y^ denotes the area in hectares of crop i, x-^ is the 
1974 USDA/SRS estimate of acreage in crop i for the county, 
X 2 - is the 1973 USDA/SRS estimate of acreage in crop i in the 
county, and X 3 is the total number of acres in the county. 
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4.7 Evaluation of Results 


Once an adequate training set has been defined, it is 
not difficult to classify large geographic areas using 
computer analysis techniques. However, unless the accuracy 
of such computer classification results can be verified, 
little has been accomplished by simply classifying the data 
over various areas of interest. 

In this investigation two quantitative evaluation tech- 
niques were used to judge the accuracy of crop classifica- 
tions and area estimates. One evaluation involved statis- 
tical sampling of individual areas of known cover types 
(designated as test fields] , This offers an effective method 
of examining inclusive and exclusive classification errors 
for the various crops or cover types. Such techniques, 
however, must be used with caution, and must be carefully 
designed to provide statistical reliability of the results. 

In general, areas need to be selected in such a way that 
the number of resolution elements in the test areas for each 
cover type are approximately in proportion to the amount of 
that cover type present in the area. 

A second quantitative technique for evaluating classifi- 
cation accuracy is comparison of area estimates from the 
computer classification and area estimates obtained by some 
conventional method. Ideally, crop area measurements from 
large contiguous areas would be used for comparison. 
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Realistically, it is not possible to acquire a large amount 
of such data. Therefore estimates of the crop areas or 
proportions must be used. The USDA/SRS annually publishes 
estimates - of the acreage of major crops for counties, dis- 
tricts, and states. Estimates or measurements for a smaller 
unit such as a township are generally not available. 

In addition to evaluating the classification accuracy, 
several factors which might have affected accuracy were 
examined. 

4.7.1 Assessment of Training and Test Field Classification 
Accuracy 

Test fields are frequently used to evaluate the accuracy 
of the Landsat classifications. Areas with, a known cover 
type which were not used for training are chosen as test 
fields. These are then classified and the accuracy of the 
classifier determined by the proportions of - pixels which are (5 
correctly identified. If these fields have been randomly 
selected and their classification accuracy is high, then the 
classification of the entire area should be accurate. 

In this project test fields were chosen in a manner 
similar to training fields. Some of the fields identified 
from the aerial infrared photography were randomly selected 
as test fields. The method of random selection depended 
upon the analyst and included systematic sampling, strati- 
fied random sampling, and simple random sampling. However, 
in some counties all the available fields were used for 
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training, leaving none for test. In these cases, training 
field performance was evaluated to determine the accuracy 
of the classifier, since a statistical test of counties 
with both test and training fields showed that using training 
fields to evaluate classification accuracy was not signifi- 
cantly different from using test fields. 

4.7.2 Statistical Comparison of Landsat and ,U SPA/ SR'S 
Estimates , 

The standard of comparison for Landsat estimates was 
the USDA/SRS estimate of' acres harvested. SRS estimates were 
used primarily because of their availability on- a state, crop 
reporting district, and county basis for 1975. There is a. 
national agricultural census which also provides these esti- 
mates, but it is performed only every five years and was not 
taken in 1975. Acres harvested were used rather than acres 
seeded because • (1) the acquisition of Landsat data used in 
this analysis was closer to harvest time than to seeding time 
and (2) the harvested acreages are used for estimating total 
production. Estimates of both the proportion of total land 
area and of the area in hectares of a crop were considered 
as variables. 

The purpose of USDA/SRS crop surveys is, primarily, to 
make national estimates and, secondly, state' estimates . The 
state estimates are considered to be unbiased and to have 
small coefficients of variation, generally not exceeding about 
5% for major crops [23]'. The SRS does publish county and 
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crop reporting district estimates, but coefficients of varia- 
tion are not calculated for these estimates. It is expected 
that the county and CRD estimates will not be as accurate as 
the state and national estimates, and that the coefficients 
of variation will be larger at the county level. The SRS 
county estimates then are not the ideal standards for com- 
parison, but must be used due to lack of any more reliable 
data. 

The method used to arrive at county estimates varies 
from state to state. In Indiana, county estimates are made 
on the basis of mail surveys. About 12,000 questionnaires 
are mailed to get a response of at least 4,000. This should 
guarantee at least SO responses per county on which to base 
the estimates. The mail survey results are adjusted for the 
difference from the June enumerative survey CE* h. Park, State 
Statistician, Indiana, personal communication). Kansas, 
however, uses information from three different surveys .to 
calculate county estimates. The first is the annual State 
Farm Census which is supposed to be an enumeration of all 
farming operations in the state, but which contains some 
incompleteness. Mail surveys from June and late summer are 
combined with the census data to form a composite area esti- 
mate for each county. These are then adjusted for various 
factors and scaled to add to the state estimate (M. E. Johnson, 
State Statistician, Kansas, personal communication). 
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The levels for testing Landsat against SRS estimates were 
determined according to the problem at hand. In choosing a 
significance level, a large a is chosen to minimize the chance 
of claiming the hypothesis of equality is true when it is really 
false; a small value of a is chosen to minimize the chance of 
rejecting the hypothesis of equality when it is actually true. 

To ascertain whether SRS and Landsat estimates were close, 
the two estimates were obtained and the hypothesis of their 
equality, the null hypothesis, was tested. Statistical tests 
are not designed to prove that the null hypothesis is true, 
although in this case that is what we did want to conclude. 

In order to be reasonably certain that the SRS and Landsat 
estimates are the same, the probability of accepting the 
hypothesis of equality, when it was in fact false, was made 
very small. This x^as achieved by choosing a large value of a 
such as 0.25. 

4.7.3 Analysis of Factors Affecting Classification Accuracy 
In order to perform statistical tests on the Landsat 
estimates, normality and homogeneity of the data must be con- 
sidered. Standard tests for homogeneity were not useful here 

because they consider the variance of the sample variances, 

2 

which in this case was zero because the variance a is deter- 
mined rather than estimated by the large sample size used in 
Landsat estimation. Instead, the range was used to determine 
if the variances were homogeneous for tests on proportions. 
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Variances are stable only for proportion estimates in the 
0.30-0.70 range [1]. Since some values of the Landsat pro- 
portion estimates fell outside this range, a transformation "was 
required. For this range, p was transformed by arcsin /p [!']'. 

The nonhomogeneity of the data affects the statistical 
test results by introducing a bias into the test statistic, 
in this case either an F-statistic or a t-statistic. The bias 
of the F-statistic for the Kansas proportion variances was 
calculated and found to be 1.29 [6]. Thus, when testing a 
hypothesis with a significance level of a = 0.05, the hypothesis 
is really being tested with a = 0.09, and will be rejected too 
often. For this amount of bias, p should be transformed. 

The bias of the test statistic for Kansas area estimate 
variances was found to be 1.17. Thus when testing a hypothesis 
with a significance level of a = 0.05, the hypothesis, would 
really be tested with a = 0.07. This is not as biased as is 
the case with the proportion variances, though the null 
hypothesis would be rejected slightly too often.. Testing was 
performed on these variables without transformation. With 
larger sample s.izes, homogeneity tends to. be- a~ minimal :problem. 
For Indiana, the proportion estimates were transformed and the 
hectare estimates were- not, foll.owing, the same pat„te,rn as for 
Kansas . 

Numerous tests were made to identify and assess- factors 
which- might affect the accuracy of the area and proportion 
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estimates. Those factors tested included: date of the Landsat 
coverage, date of the aerial photography (Indiana only), effect 
of the data analyst (Kansas only) , the effect of local versus 
nonlocal recognition, and the effect of geographic location 
(crop reporting districts) . 

For Kansas,. two types of tests were made for testing the 
effect of date. The first was a paired comparison of 10 
counties which had been classified twice using two different 
Landsat frames. The second type of test, done in both Kansas 
and Indiana, used all counties which were classified and tested 
for a difference due to groups of dates. A limitation of this 
test is that date effects may be confounded with other factors 
such as geographic location. 

Tests for the effect of aerial photography date were not 
done in Kansas because essentially only one date was used. For 
Indiana', all counties were included in the analysis and tests 
were performed inT^the same manner and with the same limitations 
as the tests for the effect of date of Landsat data. 

In tests for the data analyst and local vs. nonlocal 
recognition effects, all available data were utilized. In tests 
to determine the accuracy of a CRD or state, duplicate observa- 
tions were not permitted. Of these duplicates, the estimate 
derived from the Landsat pass closest to harvest was used 
without reference to which one was closer to the SRS estimate. 
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5.0 WHEAT IDENTIFICATION AND AREA ESTIMATION IN KANSAS 


In this section the results of the Landsat data analysis 
for winter, wheat identification and area estimation in Kansas 
are presented and evaluated. The material includes a discus- 
sion of factors affecting classification accuracy, comparisons 
and evaluations -of training and test field classif ication 
performance, and comparisons of USDA/SRS estimates to Landsat- 
derived estimates of the area and proportion of wheat. 

Finally, the accuracy and precision of the Landsat estimates 
are discussed. 

5.1 Analysis of Factors Affecting Classification Accuracy 

Although an assessment of factors affecting classifica- 
tion performance was not a primary objective, several anal- 
yses to assess factors which might have influenced classifi- 
cation results were performed in order to more fully under- 
stand and interpret the results. The variables tested 
included: Landsat acquisition date, data analyst, local vs. 
nonlocal' classifications, and the interaction of date and 
locality. The results of these tests are presented in this 


section. 
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5.1.1 Effect of Landsat Acquisition Date 

Ten of the 13 counties in the South Central Crop Report- 
ing District were classified twice, using data from two dif- 
ferent Landsat passes. All counties were classified using 
April data and then reclassified using either May or June 
data (Table 9) . Since these were the only counties for which 
multitemporal data were available, they were used to explore 
the effect of dates on classification performance* The 
"goodness" of an estimate was considered to be its closeness 
to the SRS estimate. Paired t-tests showed that there was 
no significant difference (a = 0*25) in the accuracy due to 
the date of Landsat coverage. The inference of these tests 
is not strong due to the small sample size, so a further 
study on the effect of dates with larger samples WAS per- 
formed. 

A second analysis, including all counties in Xhe seven 
districts classified, was performed to determine if there 
was an effect due. to the date of the Landsat data acquisition, 
ignoring other f actors . Five groups of dates were considered: 
early April, early May, late May, mid- June, and early July. 

An analysis of variance showed that neither the proportion 
nor area estimates were significantly affected by Landsat 
data acquisition period. These results indicate that date was 
not a major factor influencing the classification performance 
and that all counties regardless of the date of Landsat data 
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■Table 9. Comparison of wheat estimates from April and May or June Landsat data 
acquisitions to USDA/SRS harvested estimates, South Central Crop 
Reporting District, Kansas. 


County 

Date 

USDA/SRS 

Harvested 

Hectares Proportion 

Landsat 

Classification 
Hectares^ Proportion 

Difference 
From SRS 

Hectares Proportion 



(000) 

C*) • 

(000) 

C%) 

(000) 

m 

Barber 

April 

69.1 

23.3 

23^1 

7.8 

-46.0 

-15.5 


May 

69.1 

23.3 

89.4 

30.1 

20.3 

6.8 

Comanche 

April 

43.4 

20.9 

31.1 

15.0 

-12.3 

- 5.9 


May 

43.4 

20.9 

46.3 

22.3 

3.0 

1.4 

Edwards 

April 

53.1 

33.4 

58.0 

36.4 

4.9 

3.1 


May 

53.1 

33.4 

46.6 

29.3 

- 6.5 

- 4.1 

Harper 

April 

116.3 

56.0 

110.8 

53.4 

- 5.5 

- 2.6 

4 

1 

June 

116.3 

56.0 

117.8 

56.8 

1.5 

0.7 

Harvey 

April 

55.0 

39.3 

55.3 

39.5 

0.3 

0.2 


June 

55.0 

39.3 

42.2 

30.2 

-12.8 

- 9.1 

Kingman 

April 

97.0 

43.3 

113.7 

50.8 

16.7 

7.5 


May 

97.0 

43.3 

124.8 

55.8 

27.9 

12.4 

Kiowa 

April 

51.3 

27.5 

43.3 

23.2 

- 8.0 

- 4.3 


May 

51.3 

27.5 

45.6 

24.4 

- 5.6 

- 3.0 

Pratt 

April 

82.6 

43.7 

91.3 

48.3 

8.8 

• 4.6 


May 

82.6 

43.7 

80.5 

42.6 

' - 2.0 

- 1.1 

Sedgwick 

April 

105.3 

40.7 

71.0 

27.5 

-34.3 

-13.3 


June 

105.3 

40.7 

117.3 

45.4 

12.0 

4 . 6 

Sumner 

April 

196.9 

64.3 

217.0 

70.9 

20.1 

6 . 6 


June 

196.9 

64.3 

195.8 

63.9 

- 1.1 

- 0.4 



acquisition can > ' considered together. The results also 
mean that a best date for Landsat coverage cannot be recom- , 
mended from this study. 

5.1.2 Effect of Data Analyst 

Since there was no significant date 'effect, the effect 
of analysts on the classification performance could be con- 
sidered. This was a nested design with counties appearing 
within analysts,. Three analyses were run: (1) all counties 
(2) all local counties, and (3) all nonlocal counties. Each 
result: showed that the analyst effect was nonsignificant at 
any reasonable a level when considering either proportion 
or area estimates. Since all analysts used similar methods, 
no inferences can be made about methodology; but it can be 
concluded that individual analysts did not introduce a bias 
in the results. 

5.1.3 Effect of— Local vs. Nonlocal Recognition ' 

One of the major problems encountered in the LACIE has 
been to develop a means for successfully extending training 
statistics from a training segment to "recognition" segments. 
In our investigation a different methodology involving strat- 
ification of counties into groups having similar character- 
istics and developing training statistics from throughout the 
training county was used. To- determine if this method was 
satisfactory for classifying several counties the effect of 
local vs. nonlocal classification was tested. For proportion 
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estimates, the difference became apparent at thp 20% signif- 
icance level. For area estimates, however, the difference 
was significant- for any a larger than 0.10. Our conclusion 
is that there was some difference in performance between 
local and nonlocal counties; the amount of wheat: was over- 
estimated in local counties and underestimated in nonlocal 
counties; but, on the average, nonlocal recognition counties 
were closer to SRS estimates than the local recognition 
counties. It can probably be concluded that thi,s factor did 
not have a strong influence on the overall' results. 

5.1.4 Effect of Interaction Between Dates and locality 
In the South Central Crop Reporting District, there 
appeared to be an interaction between date of the Landsat 
coverage and locality. Since the sample size was too small 
to. draw any inference, a plot was made to examine this effect 
for the entire state. The interaction that was present in 
the South Central district analysis was not present over the 
entire state, although other factors which may have affected 
the accuracy were ignored. There is no good test on the 
significance of this interaction since variance estimates 
from the SRS are not available. - 

5.2 Landsat Classification Results 

The Landsat classification results include the training 
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and test field performances; estimates of the area and pro- 
portion of wheat for the state, districts, and counties; 
comparisons of the Landsat estimates to USDA/SRS estimates; 
and evaluation of the accuracy and precision of the Landsat 
estimates. In addition regression estimates of wheat area 
and proportion in two districts for which Landsat data was 
not available are presented. 

5.2.1 Classification Accuracy 

Classification accuracy was determined by the test field 
or training field performance matrices. The training field 
classification performance for all local recognition counties 
is given in Table 10. The test field performance is given in 
Table 11 for those counties which had test fields.. The 
accuracy of the classification as assessed by training fields 
is not significantly different from that found by measuring 
test field performance. The overall classification perfor- 
mances are generally 85% or higher, an indication that the 
classification should result in accurate area estimates. 

Since the classification performance of test (or training) 
fields was used to correct for classification bias in the area 
estimates, a plot was made of the absolute value of the bias 
correction of the Landsat results and the overall classifi- 
cation accuracy to show the relation between them (Figure 20) . 
The simple correlation between these two variables, is 
r = -0.80. The amount the Landsat estimates were adjusted 
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Table 10. Classification accuracy of training fields 
in Kansas. 


CLASSIFICATION ACCURACY (%) 


COUNTY WHEAT OTHER OVERALL 


CHEYENNE 

87.8 

99.0 

91.8 

GRAHAM 

84.3 

87.2 

86.1 

NORTON 

93.7 

87.0 

89.5 

SHERMAN 

70.3 

97.5 

89.5 

CLOUD 

85.1 

81.9 

83.0 

OSBORNE 

95.4 

98.6 

97.4 

OTTAWA 

99.3 

99.5 

99.3 

SMITH 

88.3 

87.0 

87.2 

GREELEY 

82.7 

93.8 

90.0 

MESS 

95.7 

89.8 

91.3 

TREGO 

76.8 

77.1 

77.1 

WALLACE 

51.7 

97.7 

90.0 

BARTON 

95.3 

83.7 

87.8 

MCPHERSON 

99.5 

98.8 

99.1 

RUSSELL 

95.0 

92.2 

93.5 

SALINE 

72.3 

92.7 

82.5 

FINNEY 

97.0 

94.5 

95.4 

FORD 

94.9 

98.8 

97.4 

HAM 1 LTOM 

75.3 

55.5 

61.9 

HASKELL 

96.4 

98.8 

‘ 97.8 

HODGEMAN 

86.3 

79.3 • 

81.3 

SEWARD 

97.8 

98.2 

98.0 

STANTON 

66.8 

62.9 

63.6 

BARBER 

96.3 

99.7 

98.1 

HARVEY 

98.1 

93.7 

95.-5 

PRATT 

99.8 

94.8 

97.0 

STAFFORD 

94.4 

98.5 

96.4 

SUMNER 

93.4 

‘95.3 

94.3 

ALLEN 

94.2 

94.5 

94.4 
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Table 11. Glassification accuracy of test fields 
in Kansas. 


COUNTY 


CLASSIFICATION ACCURACY (!) 
WHEAT OTHER OVERALL 

SHERMAN 


75,4 

89.0 

85.0 

GREELEY 


84.8 

93.0 

89.9 

TREGO 


86.7 

81.1 

82.4 

SALINE 


83.5 

94.5 

87.5 

FORD 


93.7 

97.0 

95.7 

HAMILTON 


94.2 

78.4 

82.5 

HODGEMAN 


89.4 

77.7 

80.9 

STANTON 


62.5 

79.1 

75.5 

BARBER 


92.7 

88.8 

90.4 

HARVEY " 


93.6 

98,2 

95.6 

PRATT 


92.7 

95,6 

93.8 

STAFFORD 


99-. 5 

93.4 

96.0 

SUMMER 


92.6 

89.2 

91. '2 

ALLEN 


95,3 

89.7 

90.7 
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$Ri<SiNAL PAGE IB POOR 


80 


RESOLUTE SIRS PM 


18 - 



CLASSIFICATION 
ACCURACY (%J 


Figure 20. The relationship of the magnitude of the 
.calculated bias correction t’o' overall 
classification accuracy. 
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depends primarily upon the classification accuracy, but also 
on the estimated proportion of wheat in the county. The 
graph clearly shows that high classification performance is 
desirable to reduce the need for classification bias correc- 
tion. High classification performance for each individual 
cover type is also a desirable attribute. 

5.2.2 Classification Bias Correction 

To evaluate the consistency and usefulness of the bias 
correction, a subset of Kansas counties was examined. This 
was not a random sample of Kansas counties as the first 
completed counties were used, but it was considered to be 
representative enough and large enough to determine: (1) if 
the accuracy achieved by the estimates which used training 
field performance matrices to calculate' the bias is different 
from that achieved when test field performance matrices. are 
used, (2) if error matrices can be extended to nonlocal 
recognition counties, and (3) whether correction' for the bias 
increases the accuracy.- of the estimates by decreasing the 
difference from the SRS estimates. 

To determine if the accuracy achieved by the estimates for 
which training field performance matrices were used to calculate 
the bias is different from that achieved when test field 
performance matrices were used, the variable considered was 
the difference between Landsat and. SRS estimates. The test 
performed was a two-sample t-test for difference in the 
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means between those counties for which training fields were 
used and those counties for which test fields were used 
to calculate the biases. The results were' nonsignificant at 
the 25% significance level. It can be concluded that when 
test field performance is not available, the bias can be 
calculated by using the error performance matrix from the 
training fields. 

Nonlocal recognition counties present another problem 
because there is no reference data from which a classifica- 
tion performance matrix can be obtained. Since statistics, 
for the classification were extended from another county, it 
also seemed reasonable to extend the error matrix from the 
same county. To. determine the validity of this' extension, 
differences of Landsat estimates from SRS estimates for local 
counties were tested against the differences from SRS for 
nonlocal counties. This was accomplished by t-' tests and the 
results showed that there was no difference (a = 0.25) 
between the closeness of Landsat estimates to SRS for cor- 
rected local counties and for corrected nonlocal counties. 

It, therefore, seemed reasonable to calculate the bias cor- 
rection for nonlocal recognition counties by the extension 
of an error matrix. 

Two t-tests were used for quantitative evaluation of 
the bias correction. For local recognition counties, the 
corrected estimates for proportions and areas did not differ 
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from the SRS estimates at the 25% significance level. On 
the other hand,. the uncorrected estimates did differ from 
SRS estimates at the 25% level, indicating that correction 
for the bias brought Lands at estimates closer to the SRS 
hectares harvested. Hence, all the local recognition coun- 
ties were corrected for bias by the method previously 
described. 

For the nonlocal recognition counties, the bias correc- 
tion also brought the Landsat estimates closer to the SRS 
estimates. There was a significant difference (a = 0.001) 
from SRS in both proportion and area of wheat for the uncor- 
rected estimates while the corrected estimates were not 
significantly different from the SRS estimates even at 
a = 0.25. Therefore, all nonlocal county estimates were also 
corrected for classification bias. 

In summary, we concluded that correcting for the bias 
is worthwhile since the difference of the corrected Landsat 
estimates from the SRS estimates is nonsignificant. Cor- 
rection for the bias seems to be consistent between counties 
having test performance matrices and counties having only 
training performance matrices and is also consistent in 
extending error matrices to nonlocal counties. The same 
results were obtained for this part of the analysis regardless 
of whether the variable considered was proportion or area 
of wheat. 
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5.3 Wheat Area and Proportion Estimates 


The estimates of hectares and proportions from the 
Landsat classifications on a county -by “County basis are pre- 
sented in Table 12. Estimates for both proportion and area 
of wheat are given as the uncorrected and bias-corrected 
values. The values used in. the statistical analysis w.ere 
always the bias-corrected estimates. 

5*3.1 Correlation of Landsat and USDA/SRS Estimates of Ar ea 
and Proportion of Winter Wheat ~ ' 

The SRS estimates for proportion and area of wheat 
harvested are presented in Table 13 along with the corres- 
ponding Landsat estimates and their differences. The pro- 
portion and area estimates obtained from the Landsat classi- 
fication are highly correlated with the USDA/SRS estimates.. 
The correlation between Landsat and SRS wheat harvested pro- 
portions is r = 0.77 + 0.05 (Figure 21), while' the correla- 
tion between Landsat and SRS wheat area estimates is 
r = 0.80 + 0.04 .for harvested estimates (Figure 22). The 

correlation values are presented in standard error form which 
represents approximately a 68% confidence interval. These 
intervals are not exactly symmetric, but the. furthest bound- 
ary has been presented here for simplicity [11] . 

5-3.2 Accuracy of Landsat Es timates 

The accuracy of Landsat estimates of the area and pro- 
portion of wheat can be assessed at three levels: state. 
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Table 12. .Uncorrected and bias-corrected Landsat estimates 
of -hectares and proportions of wheat in Kansas. 


LANDSAT LANDSAT 

UNCORRECTED CORRECTED j 

ESTIMATES ESTIMATES 


COUNTY 

HECTARES 

PROPORTION 

HECTARES 

PROPORT 


(000) 

<%> 

(000) 

(*> 

NORTHWEST DISTRICT 





CHEYENNE 

03.5 

35. 1 

82.6 

31.0 

OECATUR 

55.7 

23.9 

31.4 

13.5 

GRAHAM 

50.6 

25.8 

44.8 

19.4 

NORTON 

70. 1 

30.8 

50.3 

22.1 

RAWLINS 

69.0 

24.7 

76.2 

27.3 

SHERIDAN 

79.7 

34.5 

53. 1 

23.0 

SHERMAN 

46.8 

17.1 

25.8’ 

9.4 

THOMAS 

4 5.6 

16.5 

• 22.6 

8.2 

TOTAL 

520.0 

25.8 

386.8 

19.2 

NORTH CENTRAL DISTRICT 





CLAY 

37.5 

22.3 

36.5 

21.7 

CLOUD 

71.7 

38.9 

57.5 

31.2 

JEWELL 

44.8 

19.1 

19.0 

8.1 

MI.TCHELL 

83.4 

44.9 

86.7 

46.7 

OSBORNE 

78.2 

33.6 

80.7 

34.7 

OTTAWA 

54.3 

29.0 

53.5 

28.6 

PHILLIPS 

44.9 

19.3 

17.9 

7.7 

REPUBLIC 

63.8 

36.9 

52.6 

28.2 

ROOKS 

81.4 

35.4 

72.2 

31.4 

SMITH 

53. 1 

22.9 

56.3 

24.3, 

WASHINGTON 

70.1 

, 30.4 

42. 1 

18.3 

TOTAL 

680.2 

29.9 

575.0 

25.0 


WEST CENTRAL DISTRICT 


GOVE 

75.0 

GREELEY 

83. 8 

LANE 

76.5 

LOGAN 

45.1 

NESS 

89.7 

SCOTT 

60.2 

TREGO 

85.5 

WALLACE 

36.3 

WICHITA 

58.6 

TOTAL 

610.7 

CENTRAL DISTRICT 


BARTON 

120.6 

DICKINSON 

84.9 

ELLIS 

117.3 

ELLSWORTH 

61.3 

L INCOLN 

62.5 

MCPHERSON 

104.2 

MARION 

69.5 

RICE 

105.3 

RUSH 

126. 1 

RUSSELL 

67.6 

SALINE 

75.6 

TOTAL 

994.9 


27.0 

33.1 

11:9 

41.3 

89.5 

4^.1 

41.0 

60.9 

32.6 

16.2 

78.5 

28.2 

32.0 

71.2 

25.4 

32.1 

65.4 

34.9 

36.6 

60.3 

25.8 

15.4 

61.3 

26.0 

31.2 

58.4 

31-1 

29.5 

578.6 

28.0 


53.8 

107.4 

47.9 

38.3 

91.5 

41.3 

50.3 

108.2 

46.4 

32.9 

53.3 

28.6 

33. 2 - 

54.5 

28.9 

44.9 

103.9 

44.8 

28.0 

68.5 

27.6 

56.4 

95.2 

51.0 

67.2 

134.2 

71.5 

29.5 

56.8 

24.8 

40.5 

82.9 

44.4 

42.8 

956.4 

41.2 
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Table 12. (continued} 


COUNTY 

LANDSAT 

UNCORRECTED 

ESTIMATES 

. HECTARES PROPORTION 

LANDSAT - 
CORRECTED 
ESTIMATES 

HECTARES PROPORTION 


I OOO I 

(%) 

1 000 I 

(%> 

SOUTHWEST DISTRICT 





CLARK 

30.5 

12.0 

25.9 

10.2 

FINNEY 

148.5 

44.0 

143.1 

42.4 

FORD 

73.4 

26.1 

71.7 

25.5 

GRANT 

39.0 

26.5 

9.8 . 

6.6 

GRAY 

59.4 

26.4 

60. L 

26.7 

HAMILTON 

138.5 

53.9 

114.3 

44.5 

HASKELL 

30.9 

20.6 

30.9 

20.6 

HODGEMAN 

114.5 

51.4 

96.7 

43.4 

KEARNEY 

43.5 

22.0 

0.8 

0.4 

me-ade 

20.7 

8.2 

14.4 

5.7 

MORTON 

55.2 

29.4 

37.9 

20.2 

SEWARD 

36.2 

21.9 

34.2 

20.7 

STANTON- 

63.8 

36.4 

47.3 

27.0 

STEVENS 

61.6 

32.6 

28.3 

15.0 

TOTAL 

920.7 

30.0 

715.4 

23.3- 

SOUTH CENTRAL DISTRICT 





BARRER 

88.5 

29.8 

8 9.4 

30.1 

COMANCHE 

44.0 

21.2 

46.3 

22.3 

EDWARDS 

44.4 

27.9 

46.6 

29.3 

HARPER 

114.3 

55.1 

11 7. 8 

56.8 

HARVEY 

47.7 

34.1 

42.2 

30.2 

KINGMAN 

118.8 

53.1 

124.8 

55.8 

KIOWA 

43.4 

23.2 

45.6 

24.4 

PAWNEE 

77.3 

39.8 

68. 7 

35.4 

PRATT 

76.8 . 

. 40.6 

80. 5 

42. 6 

RENO 

123.3 

37.9 

108.3 

33.3 

SEDGWICK 

116.6 

45.1 

117.3 

45.4 

STAFFORD 

83.9 

40.3 

75.0 

36. 5 

SUMNER 

187.8 

61.3 

195.8 

6-3.9 

TOTAL 

1166.8 

40.2 

1158.3 

40.0 

SOUTHEAST DISTRICT 





ALLEN 

25.9 . 

19.0 

14.9 

1 L-4 

BOURRON 

25.5- 

15.4 

10.2 

6.2 

BUTLER 

38.6 

10.3 

15.8 

4.2 

•CHAUTAUQUA 

23.5 

14.1 

0.0 

0.0 

CHEROKEE 

34.3 

22.5 

22. 1 

14.5 

COWLEY • 

53.3 

18.1 

43.0 

14.6 

CRAWFORD 

24.9 

16.1 

- 10.8 

7.0' . 

ELK 

27.9 

16.7 

0.0 

0.0 

GREENWOOD 

59.8 

20. 1 

0.0 

0.0 

LABETTE 

.34. 5 

20.3 

20.4 

12.0 

MONTGOMERY 

57.2 

34.0 

23.2 

13.8 

NEOSHO 

24.2 

15.9 

10.4 

6 * 8 

WILSON 

57.6 

38.7 

33.5 

22.5 

WOODSON 

55. 7 

42.7 

38.1 

.29.2 

TOTAL 

542.9 

20.3 

242.4 

9.1 

STATE TOTAL 

5444.2 

31.4 

4612.9 

26.6 
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Table 13. Comparison of USDA/SRS wheat harvested estimates 
and' bias-corrected Landsat estimates of area and 
proportion of wheat in Kansas. 


USDA/SRS LANDSAT DIFFERENCE 

HARVESTED CLASSIFICATION FROM SRS 


COUNTY 

HECTARES 

PROPORTION 

HECTARES 

PROPORTION 

HECTARES 

PROPORTION 


(000) 

(95) 

.(000) 

(%> 

(000) 

(%> 

NORTHWEST DISTRICT 




. 



CHEYENNE 

61.0 

22.9 

82.6 

31.0 

21. .7 

a.i 

DECATUR 

48. 6 

20.9 

31.4 

13.5 

-17.2 

-7.4 

graham 

44.2 

19.1 

44.8 

19.4 

0.6 

0.3 

NORTON 

42.3 

18.5 . 

50.3 

. 22.1 

8.0 

3.5 

RAWLINS 

60.3 

21.6 

76.2 

27.3 

15.9 

5.7 

SHERIDAN 

50.2 

21.7 

53.1 

23.0 

3-.0 

1.3 

SHERMAN 

73.1 

26.7 

25.8 

9.4 

-47.4 

-17,3 

THOMAS 

90.4 

32.6 

2216 

8.2 

-67.8 

-24.5 

TOTAL 

470.1 

23.3 

386.8 

19.2 

-83.3 

.-4.1 

NORTH CENTRAL DISTRICT 






CLAY 

45.0 

26.8 

36.5 

21.7 

-8.5 

-5.1 

CLOUD 

58.1 

31.6 

57.5 

31.2 

-0.6 

-0.3 

JEWELL 

56.4 

24.0 

19.0 

. -8.1 

-37.4 

-15.9 

MITCHELL 

71.2 

38.4 

86.7 

46.7 

15.6 

8.4 

OSBORNE 

57.9 

24.9 

80.7 

34.7 

22.8 

9.8 

OTTAWA 

66.3 

35.4 

53.5 

28.6 

-12.7 

-6.8 

PHILLIPS 

35.8 

15.4 

• 17.9 

7.7 

-18.0 

-7.7 

REPUBLIC 

47.1 

25.3 

52.6 

28.2 

5.5 

3.0 

ROOKS 

53.6 

23.3 

72.2 

31.4 

10.6 

8.1 

SMITH 

45.6 

19.7 

56.3 

24,3 

10.7 

4.6 

WASHINGTON 

41.0 

17.8 

42.1 

181.3 

1.1 

0.5 

TOTAL 

578.0 

25.1 

575.0 

25.0 

-3.0 

-0.1 

WEST CENTRAL DISTRICT 






GOVE 

56.5 

, 20.4 

33.1 

11.9 

-23.4 

-8.4 

GREELEY 

72.2 

35.6 

89.5 

44.1 

17.3 

8.5 

LANE 

55.1 

29.5 

60.9 

. 32.6 

5.8 

3.1 

LOGAN ‘ 

64.0 

23.0 

78.5 

28.2 

14.5 

5.2 

NESS 

74.7 

26.7 

71.2 

25.4 

-3.5 

-1.2 

SCOTT 

58.2 

31.1 

65.4 

34.9 

7.2 

3.9 

TREGO 

49.8 

21.3 

60.3 

25.8 

10.5 

4.5 

WALLACE 

35.0 

14.8 

61.3 

26.0 

26.3 

11.1 

WICHITA 

56.1 

29.9 

58.4 

31.1 

2.4 

1.3 

TOTAL 

521.6 

25.2 

578.6 

28.0 

57.0 

2.8 

CENTRAL DISTRICT 







BARTON 

95.7 

42.7 

107.4 

47.9 

11.6 

5.2 

DICKINSON 

72.3 

32.6 

91.5 

41.3 

19.3 

8.7 

ELLIS 

54.8 

23.5 

108.2 

46.4 

53.5 

22.9' 

ELLSWORTH 

52.3 

28.1 

53.3 

28.6 

1.0 

0.6 

LINCOLN 

53.8 

28.6 

54.5 

■ 28.9 

0.6 

0.3 

MCPHERSON 

99.6 

43.0 

103.9 

44.8 

4.3 

1.9 

MARION 

65.1 

26.2 

68.5 

27.6 

3.4 

1.4 

RICE 

78.5 

42.0 

95.2 

51.0 

- 16.8 

9.0 

RUSH 

74.9 

39.9 

134.2 

71.5 

59.3 

31.6 

RUSSELL 

56.7 

24.8 

56.8 

h 24 . 8 

0.1 

0.0 

SALINE 

66.0 

35.4 

82.9 

44.4 

16.9 

9.0 

TOTAL 

769.7 

33. 1 

956.4 

41.2 

186.7 

8.1 
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Table 13. (continued) 



USDA/SRS 

LANDSAT 

difference 


HARVESTED 

CLASSIFICATION 

FROM 

SRS . 

COUNTY 

HECTARES 

PROPORTION 

HECTARES 

PROPORTION 

'HECTARES 

PROPORTION 


(000) 

<%> 

(000) 

(%> 

(000) 

(%) 

SOUTHWEST D l STRICT 




' 


Cl ARK 

44.4 

17.4 

25.9 

10.2 

- 18.5 

— T « 3 

FINNEY 

94.2 

27.9 

143 . 1 

42.4 

48.9 

14.5 

FORD 

95.6 

34.1 

71.7 

25.5 

-23.9 

”■8.5 

GRANT 

36.2 

24.6 

9.8 

6.6 

— 26.4 

-18 .0 

GRAY 

70.1 

31.1 

60.1 

26.7 

— 10*0 

—4 . 5 

HAMILTON 

62.7 

24.4 

114.3 

44.5 

51 . 6 

20. 1 

HASKELL 

46.1 

30.7 

30.9 

20.6 

-15.2 

— 10. 1 

HODGEMAN 

55.5 

24.9 

96.7 

43.4 

41.2 

18.5 

KEARNEY 

53.6 

24.3 

0.8 

0.4 

-52.9 

-23.9 

MEADE 

62.9 

24.9 

14.4 

5.7 

— 48 ^ 6 

-19. 2 

MORTON 

36.3 

19.3 

37.9 

20.2 

1.6 

0*8 

SEWARD 

38.3 

23.1 

34.2 

20.7 

—4 • i 

-2 . 5 

STANTON 

49.9 

28.5 

47.3 

27.0 

~2 m 6 

-1.5 

STEVENS 

38.1 

20.2 

28.3 

15.0 

-9.8 

— 5*2 

TOTAL 

703.9 

25.6 

715.4 

23.3 

-68.5 

-2.3 

SOUTH CENTRAL 

DISTRICT 






GARBER 

69.1 

23.3 

89.4 

30. 1 

20*3 

6.8 

COMANCHE 

. 43.4 

20.9 

46.3 

22.3 

3.0 

1.4 

EDWARDS 

53.1 

33.4 

46 . 6 

29.3 

-6.5 , 

—4- 1 

HARPER 

116.3 

56.0 

117.8 

56.8- 

1.5 

0*7 

HARVEY 

55.0 

39.3 

42.2 

30.2 

-12.8 

—9 » 1 

K iNGMAN 

.97 .0 

43.3 

124.8 

55.8 

27.9 

12.4 

KIOWA 

51.3 

27.5 

45.6 

24.4 

-5.6 

-3*0 

PAWNEE 

71.5 

36.9 

68.7 

35.4 

-2.8 

-1*4 

PRATT 

82.6. 

4 3. 7 

80.5 

42.6 

2 • 0 

-i.i 

RENO 

146.4 

45.0 

108.3 

33.3 

-38.0 

-11.7 

SEDGWICK 

105.3 

40.7 

117.3 

45 . 4 

12.0 

4.6 

STAFFORD 

76.6 

37.3 

75.0 

*36, . 6 

-1.6 

— 0.8 

SUMNER 

196.9 

64.3 

195.8 

63.9 

-1.1 

— 0 * 4 

TOTAL 

1164.5 

40.2 

1158.3 

o 

* 

o 

-6.2 

-0.2 

SOUTHEAST OiSTRICT 






ALLEN 

11.4 

8.7 

14.9 

11.4 

3.5 

2.7 

BOURBON 

— 7.5 

4.5 

10.2 

6.2 

2.7 

1.6 

BUTLER 

• 42.3 

11.3 

15.8 

4 *2 

— 26.6 

—7 • 1 

CHAUTAUQUA 

8.9 

5.3 

0.0 

0.0 

-8.9 

-5.3 

CHEROKEE 

18.9 

12.5 

22.1 

14.5 

3.1 

2.1 

COWLEY 

64.3 

21.8 

43.0 

14.6 

-21.3 

— 7.2 

CRAWFORU 

10.9 

7.0 

10.8 

7.0 

-0.0 

0.0 

ELK 

8.9 

5-3 

0.0 

0.0 

-8.9 

— 5.3 

GREENWOOD 

6.9 

2.3 

0.0 

• 0.0' 

—6 • 9 

~2 * 3 

LABETTE 

20.8 

12.3 

20.4 

12.0 . 

-0.4 

-0.2 

MONTGOMERY 

23.0 

13.7 

23.2 

13.8 

0.2 

0. L 

NEOSHO 

14.1 

9.3 

10.4 

6.8 

—3 • 7 

“2 * 4 

WILSON 

21.5 

14.5 

33.5 

22.5 

12.0 

8.0 

WOODSON 

7.7 

5.9 

38. 1 

29.2 

30.3 

23.2 

TOTAL 

267.1 

10.0 

242.4 

9. 1 

-24.7 

0.9' 

STATE TOTAL 

. 4554.9 

26.2 

4612.9 

26.6 

58.0 

0.4 
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Figure 21. The correlation of Landsat and USDA/SRS 
estimates of the proportion of winter 
wheat in Kansas counties. 
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Figure 22. The correlation of Landsat and USDA/SRS 
estimates of the area of winter wheat 
in Kansas counties. 
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district, and county. A summary of the results at these 
three levels, including comparisons with the corresponding 
SRS estimates, is shown in Table 14. It should be noted 
that in comparing Landsat to SRS figures that the SRS fig- 
ures are also estimates (and, thus subject to sampling error). 
The accuracy of the SRS estimates is greatest at the state 
level and least at the county level. 

In tests of the accuracy of Landsat estimates at the 
state level,- a large a was used to reduce the possibility of 
claiming that Landsat estimates were the same as SRS esti- 
mates when, in fact, they were not. T- tests were performed 
to determine if there was a significant difference between 
Landsat and SRS estimates. At the 25% significance level, 
there was no difference in the proportion or area of wheat. 

At the crop reporting district level there was no 
significant difference in Landsat and SRS , estimates of pro- 
portion or area of wheat except in the Central CRD. In the 
Central CRD, wheat was overestimated for every county in 
relation to the SRS estimates, creating a bias' in the CRD 
estimate. However, all the county estimates were close to 
the SRS estimates except for two counties which, accounted 
for most of the difference. The Central CRD is not the 
"worst” CRD when considering relative difference or average 
absolute difference from SRS as a measure of comparison 
between crop reporting districts (Table 15). On the whole. 
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Table 14. Summary of USDA/SRS and Landsat estimates of 
area and proportion of wheat in Kansas. 




Area 


Proportion 

Region 

USDA/SRS 

Landsat 

Difference 

USDA/SRS Landsat Difference 


(000. Hectares) (I) 


State 

■ 4555 

4613 

58 

26. '2 

26.6 

0.4 

District 







Northwest 

470 

387 

- 83 

'23.3 

19.2 

-4.1 

North Central 

578 

575. 

- 3 

25.1 

25.0 

-0.1 

West Central 

522 

579 

57 

25/2 

28.0 

2.8 

Central 

770 

956 

187 

33.1 

41.2 

8.1 

Southwest 

784 

715 

- 68 

25.6 

23.3 

-2.3 

South Central- 

1164 

1158 

6 

40.2 

'' 40.0 

-0.2 

Southeast 

267 . 

242 

- 25 

1 10.0 

9.1 

• -0.9 

Counties 







(Median) 

55.0 

53.4 

0.6 

24.85 

26.25 

0..4 



Table 15. Relative difference and average absolute 

difference between Landsat and SRS estimates 


for 

districts 

and state. 



District 

Landsat 

Estimate 

• Difference 
from SRS 

Relative 

Difference 

'..Average 

Absolute 

Difference 


(000 Ha) 

(000. Ha) 

(%) 

(000 Ha) 

Northwest 

386.8 

- 83.3 

-21/5 

22.7 

North Central 

575.0 

- 3.0 

- 0.5 

13.8 

West Central 

578.6 

57.0 

9,9 

12.3 

Central 

956. *4 

186.7 

19/5 

17.0 

Southwest 

715.4 

- 68. .5 

- 9.6 

25.4 

South Central 

1158.3 

- 6.2 

- 0-.5 

10.4 

Southeast 

242.4 

- 24.7 

-10-. 2 

' -'9.2 

State 

4612.9 

58.0 

1.3 
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Landsat estimates were fairly close to SRS proportion and 
area estimates on a crop reporting district basis. 

No statistical tests could be performed for differences 
from SRS estimates on a county -by- county basis because SRS 
does not calculate county variance estimates. Similarly, 
confidence limits cannot be placed around tbe SRS estimates. 
However, if the standard deviation of the SRS proportion 
estimates is assumed to be at least 10% at the county level, 
then 89% of the Landsat estimates were within a 90% confi- 
dence interval. For further comparison of Landsat and SRS 
county estimates, 49% of the counties were within +5% 
(absolute difference) of SRS, 81% were within ^10%, and 88% 
were within jK15%. 

5.3.3 Precision of Landsat Estimates 

The second measure of the quality of an estimate is 
its precision- which refers to the size of the deviations from 
its expected value obtained by repeated application of the 
sampling procedure. Using statistical theory, however, it is 
not necessary to repeatedly sample the population to deter- 
mine the variance of an estimate. 

The Landsat estimates are of a binomial nature since 
each point was classified as wheat or other. The variance of 
p for a single county was calculated as: 
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where p is the proportion estimate after correction for the 
bias, n is the number of pixels classified in the county, 
and f=^ -where N is the- total number of pixels in the county. 
The standard deviations for' the -districts and State were - 
calculated considering the sample as 'stratified, but were 
approximately the same size as when- calculated under the 
assumption of a systematic random sample throughout the CRD 
or state. 

The standard deviations * and coefficients of variation 
of the Landsat estimates are shown in Table 16: It can 

readily be seen that the standard deviations and the coef- 
ficients of variation (CV) are extremely small even at the • 
county level., The CV of the SRS estimate of wheat acreage 
in the state of Kansas ’is 4%,, compared to the CV of 0706% 
for the Landsat estimate. The median CV' of the- La&dsrat 
county estimates -is 0,.60% which is smaller even than the 1.5% 
CV of the SRS national estimate of wheat acreage. Clearly 
the combined technologies of Landsat MSS data and computer- 
aided classification methods provides a means- to mahe- very 
precise crop area estimates. 
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Table 16. Estimates of the' standard deviations and coefficients 
of variation of Landsat estimates of wheat in Kansas. 


Proportion 

Area* Estimate Estimate 

Coefficient 

Standard Standard r of 


C o.unty. 

. Hectares 

Deviation 

% Deviation , 

Variation 


.(000 Ha) 

(Ha) 



(%) 

Northwest District 





Cheyenne 

82.6 

280.02 

31.0 

.1052 

.33 

Decatur 

31.4 

432.59 . 

13.5 

.1857 

1.38 

Graham 

4-4.8 

519.21 

19.4 

.2249 

1.16 

Norton . 

50.3 

527.01 

22.1 

. 2311 

1.05 

Rawlins 

76.2 

611.92 

27.3 

.2191 

.80 

Sheridan 

53.1 

235.82 

23.0 

.‘1019 

.44 

Sherman 

25.8 

184.11 

- 9.4 

. 0674 

.72 

Thomas 

22.6 

375.80 

8.2 

.1356 

1.65 

• Total 

386.8 

1191.33 

19.2 

.0590 

.31 

North Central District 





Clay 

' -36.5 

448.79 

21.7 

. 2668 

1.23 

Cloud 

‘ 57.5 

566.41 

31.2 

.3074 

.99 

Jewell 

19.0 

359.92 

8.1 

.1532 

1.89 

Mitchell 

- 86.7 

567.23 

46.7 

. 3058V 

.65 

Osborne 

- 80.7 . 

604.48 

34.7 

.2598 

. 75 

Ottawa 

53.5 

233.98 

28.6 

.1249 ' 

.44 

Phillips 

'17.9 

354.56 

- 7.7 

.1523 

1.98 

Republic 

52.6 

517.03 

28,2 

. 2775 

.98 

Rooks 

72.2 - 

689.56 

31.4 

. 2997 

.95 

Smith 

56.3 

561.17 

24.3 

. 2425 

1.00 

Washington 

42.1 

621.13 

18.3 

.2691 

1.47 

Total 

575.0 

1721.33 

25.0 

.0747 . 

.30 

West Central District 



* 


Gove 

33.1 

199.98 

11.9 

‘ .0714 

.60 

Greeley 

89.5 

265.57 

44.1 

.1309 

.30 

Lane 

60.9 

289.98 

32.6 

.1555 

.48 

Logan 

78.5 

278.-04 

28.2 

.1000 

.35 

Ness 

71.2 

271.56 

25.4 

..0969 

‘ .38 

Scott 

65.4 

243.08 

34.9 

- .1297 

.37 

Trego 

60.3 

' 249.10 

25.8 

.1067 

.41 

Wallace 

61.3 

249.47 

26.0 

.1057 

.41 

Wichita 

58.4 

236.34 

31.1 

.1260 

.41 

Total 

578.6 

763.55 

28.0 

. 0369 

.13 
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Table 16. (continued) 


County 


Area Estimate 
* ' ■ 

Standard 

Hectares Deviation 


Proportion 

Estimate 

— ~7"; Coefficient 

Standard of 

% Deviation Variation 


* 

(000 Ha) 

(Ha) 


C%) 

C%) ■ 

tral District 






Barton 

107.4 

269.37 

47.9 • 

.1202 

.25 

Dickinson 

91.5 

274^76 

41.3 

.1240 

.30 

Ellis 

108.2 

284.36 

46.4 

.1219 

.26 

Ellsworth 

53.3 

503.91 

28.6 

.2708 

.95 

Lincoln 

54.5 

522.31 

28.9 

.2777 

.96 

McPherson 

103.9 

283.67 

44.8 

.1223 

.27 

Marion 

68.5 

263.38 • 

27.6 

.1060 

.38 

Rice 

95.2 

562.69 

51.0 

.3012 

.59 

Rush 

134.2 

232.65 

71.5 

.1240 . 

.17 

Russell 

56.8 

537.75 

24.8 

.2351 

.95 

Saline 

82.9 

256.30 

44.4 . 

.1374 

.31 

Total 

956.4 

1277.74 

41.2 

.0550 

.13 

ithwest District 





Clark 

25.9 

182.06 

10.2 

.0714 

-.70 

Finney 

143.1 

783.49 

42.4 

.2323 

.55 

Ford 

71.7 

269.07 

25.5 

.0959 

.38 

Grant 

9.8 

110.96 

6.6 

.0754 

1.14 

Gray 

60.1 

552.52 

26.7 

.2454 

.92 

Hamilton 

114.3 

308.61 

44.5 

.1200 

.27 

Haskell 

30.9 

412.53 

20.6 

.2750 

1.33 

Hodgeman 

9 6. .7 

275.23 

43.4 

.1235 

.28 

Kearney 

0.8 

43.31 

0.4 

.0196 

4.90 

Meade 

14.4 

306.19 

5. .7 

■ .1210 

2.12 

Morton 

37.9 

205.85 

20.2 

.1096 

.54 

Seward 

34.2 

433.69 

20.7 

.2619 

1.27 

Stanton 

47.3 

217.81 

27.0 

.1244 

.46 

Stevens 

28.-3 

182.13 

15.0 

.0964 

.64 

Total 

715.4 

1336.91 

23.3 

.0436 

.19 
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Table 16. (continued) 


Proportion 

Area Estimate Estimate 


County 

Hectares 

Standard 

Deviation 

Standard 
% Deviation 

Coefficient 

of- 

Variation 


(000 Ha) 

(Ha) 


(%) 

(%) 

South Central 

District 





Barber 

•89.4 

291.83 

30.1 

.0983 - 

.33 

Comanche 

.46.-3 

219.97 

22.3 

.1061 

.48 

Edwards 

46.6 

213.44 

29.3 

.1341 

.46 

Harper 

117.8 

265.85 

56.8 

.1281 

.23 

Harvey 

42.2 

209.98 

30.2 

.1501 

.5.0 

Kingman 

124.8 

278.11 

55.8 

.1243 

.22 

Kiowa 

, 45.6 

216.33 

24.4 

.1160 

.48 

Pawnee 

68.7 

244.64 

35.4 

.1261 

.36 

Pratt 

80.5 

252.87 

42.6 

.1339 

.31 

Reno 

‘ 108.3 

312.23 

33.3 

.0960 

• .29 

Sedgwick 

117.3 

297.32 

45.4 

.1150 

.25 

Stafford 

75.0 

295.20 

36.5 

.1435 

.39 

Sumner 

195.8 

311.55 

63.9 

.1018 

.16 

Total 

1158.3 

954.06 

40.0 

. 0329 

..08 

Southeast District 





Allen 

14.9 

138.02 

11.4 

.1055 

.93 

Bourbon 

10.2 

113.60 

' 6.2 

.0686 

1.11 

Butler 

15.8 

147.35 

4.2 

. 0394 

.94- 

Chautauqua 

0.0 

0.00 

0.0 

. 0000 

.00 

Cherokee 

• 22.1 

162.31 

14.5 

.1067 

' .74 

Cowley 

43.0 

224.81 

14.6 

.0764 

.52 

Crawford 

10.8 

122.77 

7.0 

. 0792 

1.13 

Elk 

• 0.0 

0.00 

0.0 

.0000 

.00 

Greenwood 

0.0 

0.00 

0.0 

.0000 

.00 

Labette 

20.4 

156.22 

12.0 

.0922 

.77 

Montgomery 

23.2 

166.20 

13.8 

.0988 

. .72 

Neosho 

10.4 

115.64 

6.8 

.0760 

1.12 

Wilson 

33.5 

187.84 ' 

22.5 

.1263 ' 

.56 

Woodson 

38.1 , 

194.02 

29.2 

.1486 

.51 

Total 

' 242.4 

532.05 

9.1 

.0199 

' .22 

State Total 

4612.9 

3089.32 

26.6 

.0178 

.07 
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5.4 Regression Estimation for Wheat in Areas without 
Lands at Coverage 

Usable Landsat data was not available for the Northeast 
and East Central Crop Reporting Districts; thus those dis- 
tricts were not analyzed. Since estimates of area and pro- • 
portion of wheat in the counties were required, a prediction 
equation was formulated using the 80 counties which had been 
classified with Landsat data. The Landsat wheat estimates 
were written as a function of historical wheat production 
in the two previous years and acres in the county. The 
prediction equation derived by this procedure was: 

y = 10274.97 + 0.66 - 0.26 x 2 - 0.02 * 3 

where x^ is the acreage of wheat grown in a county in 19 74, 
x 2 is the acreage of wheat grown in a county in 1973, x^ 

- c . A* 

is the number of acres_in the county, and y is the "pseudo- 

2 

Landsat" estimate in hectares. TheR value for the 

I 

regression equation was 0.65. 

Regression is good for prediction only when the x values 
corresponding to the estimate to be predicted fall within 
the range of the x values used in deriving the equation. If 
this held true for a given county, the estimate was made from 
the prediction equation. If this did not hold true, the 
USDA/SRS wheat estimate from the 'previous year was used. The 
estimates are presented in Table 17. 


TOO 



Table 17, Regression estimates of area and proportion of 

winter wheat in counties for which usable Landsat 
data was not available. 


County 

Proportion 
SRS Predicted 

00 

. Diff. 

Hectares (000) 

SRS Predicted Diff. 

Northeast District 






*Atchison 

10.3 

7.0 

-3.3 

11.2 

' 7. '7 

- 3.5 

Brown 

10.7 

9.3 

-1.4 

16.0 

14.0 

- 2.0 

*Doniphan 

6.6 

4.5 

-2.1 

6.5 

• 4.4 

- 2.1 

Jackson 

7.9 

7.4 

-0.5 

13.4 

12.6 

- 0.8 

Jefferson 

7.2 

8.7 

1.5 

9.9 

11.9 

2.0 

*Leavenworth 

6.6 

4.3 

-2.3 

7.9 

5.1 

- 2.8 

Marshall 

17.2 

14.4 

-2.8 

40.6. 

34.0 

- 6.6 

Nemaha 

11.9 

10.1 

-1.8 

21.8 • 

18.6 

- 3.2 

Pottawatomie 7.9 

6.2 

-1.7 

16.9 

• 13.3 

- 3.6 

Riley 

9.0 

9.4 

0.4 

14.0 

14.7 

0.7 

*Wyandotte 

2.0 

1.1 

-0.9 

0.8 

0.4 

- 0.4 

Total 

9.9 

8.5 

-1.4 

159.0 

13.6.7 

-22.3 


East Central 
Anderson 

District 

8.5 

7.2 

-1.3 

12.8 

10.7 

- 2.1 

Chase 

4.7 

3.8 

-0.9 

9.5 

7.7 

- 1.8 

Coffey 

7.9 

6.1 

-1.8 

13.4 

10.4 ■ 

- 3.0 

*Douglas 

9.7 

7.2 

-2.5 

11.7 

8.7 

- 3 . U 

Franklin 

8.6 

8.4 

-0.2 

12.9 

12.5 

- 0.4 

*Geary 

11.3 

10.2 

-1.1 

11.7 

10.5 

- 1.2 

*Johnson 

5.0 

3.6 

-1.4 

6.1 

4.4 

- 1.7 

Linn 

5.3 

4.7 

-0.6 

8.4 

7.4 

- 1.0 

Lyon 

8.6 

5.2 

-3.4 

18.9 

11.5 

- 7.4 

Miami 

6.2 

5.7 

-0.5 

9.5 

8.8 

- 0.7 

Morris 

14.0 

13.2 

-0.8 

25.5. 

24.1 

- 1.4 

Osage 

9.2 

7.1 

-2.1 

17.1 

13.1 

- 4.0 

Shawnee 

10.6 

11.7 

1.1 

14.9 

16.3 

1.4 

Wabaunsee 

6.1 

5.0 

-1.1 

12.6 

10.2 

: 2 - 4 

Total 

8.2 

6.9 

-1.3 

185.0 

156.3 

-28.7 


historical estimates used. 
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The estimates obtained were tested for differences from 
SRS estimates of proportion and area of wheat harvested on 
a crop reporting district basis. There were significant 
differences from SRS in both area and proportion estimates 
in both crop reporting districts. Estimation from regres- 
sion consistently underestimated wheat as did the historical 
estimates. Regression seems a reasonable alternative if 
Landsat estimation cannot be done for a given county, but a 
significant decrease in the accuracy of the estimates is 
likely to occur. 
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6.0 CORN AND SOYBEAN IDENTIFICATION AND AREA 
ESTIMATION IN INDIANA 


The second state selected for analysis was Indiana; corn 
and soybeans, the two major grain crops in the state, were 
selected for study. This section includes the results of the 
Landsat data classifications and analyses. As for Kansas, the 
material presented includes a discussion of the factors affect- 
ing classification performance, comparisons of USDA/SRS and 
Landsat estimates of the area and proportions of the crops of 
interest, and evaluations of the accuracy and precision of the 
Landsat estimates. 


6.1 Analysis of Factors Affecting Classification Accuracy 

The effects of several factors likely. to influence the 
accuracy of the Landsat area and proportion estimates were 
investigated. These included: Landsat acquisition date, aerial 
photography acquisition date, and local vs. nonlocal training 
and classification. There are, of course, many additional 
factors such as field size, number of crops and cover types 
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present, uniformity of soils, and production practices, which 
may have also influenced the results , but which were beyond 
the scope of this investigation to pursue. 

6.1.1 Effect of Landsat Acquisition Date 

To study the effect of the date of Landsat coverage on 
the accuracy of the estimates, pairwise comparisons were made 
among three groups of dates (July, August, and September) 
without considering the effect of other factors. Different 
counties were in each group since all counties in Indiana 
were ' classified only once-. The accuracy of an estimate was 
considered to be its closeness to the SRS estimate. 

The estimates of the proportion and area of corn were 
significantly further from "the SRS estimates (a > 0.0-2) 
using September , Landsat data than either July or August data. 
For soybean proportion- and area estimation, the effect of 
Landsat acquisition date was not significant. 

Estimates made from July and August Landsat data were 
not significantly different In accuracy for either com or 
soybeans; thus, either date could be recommended. However, 
the August estimates of both corn and soybeans were closer in 
average difference to the SRS estimates than were the July 
estimates. 'Similar results were -obtained in the 'CITARS 
experiments in which -corn -and soybeans in six Indiana and 
Illinois test sites were classified throughout the growing 
season [ 5 
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6.1.2 Effect of Aerial Photography Acquisition Date 

Three groups of dates (July, August, and September) 
also existed for the aerial photography acquisition dates. 
Although the groups are the same as for the study of Landsat 
acquisition date, the counties within each group were not 
always the same since photographic acquisition was not 
necessarily coordinated with Landsat data acquisition. Con- 
sidering performance as a function of photography acquisition 
date only for corn estimation, both July and August estimates 
were significantly closer to the SRS estimates than September 
estimates were. For soybean estimation, August estimates 
were significantly closer to the SRS estimates than were the 
July estimates, while not significantly closer than September 
estimates . 

Even though there was not a significant difference in the 
accuracy of July and August estimates for corn or of August 
and September estimates for soybeans, the August estimates 
were closer to the SRS estimates in both cases. The best 
time for aerial infrared photography acquisition appears to 
be August, coinciding with the optimal time period for the 
Landsat data acquisition. In some cases, multidate photo- 
graphy proved useful for identifying corn and soybeans when 
individual acquisition dates were not acquired at a good time 
for interpretation. 
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6.1.3 Effect of Local vs. Nonlocal Classification 

The significance of the effect of local versus nonlocal 
classification depended upon the crop being estimated. Corn 

v 

estimates were significantly better in nonlocal counties than 
in local recognition counties; an explanation of this unexpected 
result has not been identified. Soybean classification accu- 
racy was not significantly affected by local versus nonlocal 
classification although local counties were closer to SRS 
estimates on the average. 

6.2 Landsat Classification Results 

The Indiana results include training field classification 
performances, estimates of the area and proportions of corn 
and soybeans for 43 counties in four districts, comparisons of 
the Landsat and USDA/SRS estimates, evaluation of the accuracy 
and precision of the estimates, and regression estimates for 
counties for which Landsat data were not analyzed. 

6.2.1 Classification Accuracy 

Classification accuracy was determined for Indiana by the 
training field performance matrices. No test fields \?ere used 
in Indiana since it was felt that additional training data 
would be more valuable than having test fields; comparison of 
classification accuracies of training and test fields in Kansas 
showed them to be not significantly different. The training 
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field classification performance for all local recognition 
counties is given in Table 18. 

The training field classification performances are typi- 
cally 75 to 85 percent. Although these accuracies are about 
10 percent loiter than obtained in Kansas, they would generally 
be considered adequate for making satisfactory area estimates 
provided a consistent bias was not present. As shall be shown 
in subsequent sections, the area and proportion estimates, 
particularly on a county basis, are not as accurate as might 
have been predicted from the training field classification 
performances. This is believed to be caused by a combination 
of two factors. First, the training performances are for 
"pure” pixels from the centers of fields; the area estimates, 
however, are made from samples including "mixed" or field 
boundary pixels. The proportion of pure pixels for Indiana 
fields which average only about 10 hectares in size is typi- 
cally no more than. 50 percent. Secondly, we encountered some 
difficulty in accurately identifying all fields as corn, 
soybeans, or other. Since positive identification of a field 
was required in. order to use it for training, a significant 
number of fields representing several spectral classes was 
omitted from training. This would cause the training field 
classification performance to be biased upward. 

6.2.2 Classification Bias Correction 

Training field performance matrices were used to calculate 
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Table 18. 

Classification 
Indiana . 

accuracy of 

training fields 

in 



Classification Accuracy (%)' 


County 

Corn 

Soybeans 

Other 

Overall 

Benton 

87.0 

98.1 

72.2 

83.7 

Lake 

79.6 

89.4 

91.5 

85.7 

LaPorte- 

85.0 

97.0 

88. 8 

89.1 

Newton 

86.2 

97.1 

70.0 

84.1 

Pulaski 

92.3 

98.2' 

85.8 

91.6 

Starke 

92-. 3 

9 8. .2 

. 85,. 8 

91.6 

White 

90.9 

89.8 

78.7 

87.5 

Fountain 

88.6 ( 

91.9 

79. '8 

'86.1 

Montgomery 

84.6 

89.8 

81.2 

85.6 

Owen 

87.-2 

64.0 

94.2 

84.1 

Parke 

88.6 

91.9 

-- 79.8 

86.1 

Tippecanoe 

98.3 

90.9 

86.9 

92.5 

Vigo 

61.8 

60.4 

89.6 

75.9 

Warren 

-95.3 

94.4 

92.2 

93.9 

Decatur 

79<. 4 

98.1 

79.1 

85.3 

Grant 

91.8 

98.5 

72.7' 

89.2 

Hamilton 

71.6 

98.0 

76.6 

81.1 

Hancock 

85.1 

99.1 - 

84.8 

90.4 

Howard 

71.6 

98.0 

76.6 

8-1.1 

Johnson 

90.3 

93.7 

94.8 

92.5 

Madison 

88.4 

97.6 

73.3 

88.8 

Shelby 

90.3 

93.7 

'94.8 

92.5 ' 

Tipton 

7L. 6 

98.0 

76. -6 

81.1 

Fayette 

90.5 

90.9 

85.1 

88.5 

Jay 

73.5 

88.5 

81.5 

83.6 

Randolph 

84.4 

95.5 

IS. 9' ' 

87.8 

Wayne 

88.1 

94.7 

82.3 

88.3 
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the bias in the absence of test fields; the Kansas analysis 
had demonstrated this was feasible. Also following the results 
from, the Kansas analysis, error matrices were extended to 
nonlocal recognition counties. 

All crop estimates were corrected for the bias because 
this operation brought them closer to SRS estimates on the 
average. For soybeans, there was no significant difference at 
any reasonable a level in the accuracy of corrected and 
uncorrected estimates. For corn estimates, however, corrected 
estimates were closer to SRS at the 20% significance level. 

6.3 Corn and Soybean Area and Proportion Estimates 

Tables 19 and 20 present the results of the Landsat classi- 
fications on a county -by -county basis. Estimates for both 
proportion and area of corn and soybeans are given as the 
uncorrected and bias -corrected values. The values used in the 
statistical analysis were always the bias-corrected estimates. 

6.3.1 Correlation of Landsat and USDA/SRS Estimates of Area 
and Proportion of Corn and Soybeans 

Plots of the Landsat vs. SRS county estimates of corn and 
soybean area and proportions, along with correlation estimates, 
are shown in Figures 23-26. The two estimates are not as 
highly correlated as the Kansas estimates; three- counties , 
however, accounted for much of the lack of correlation of the 
corn estimates.. The Landsat estimates for corn are 
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Table 19. Uncorrected and bias-corrected Landsat estimates 
of hectares and proportions of corn in Indiana. 


County 

Uncorrected 
Hectares Proportion 

Bias-Corrected 
Hectares Proportion 

coocn 

Northwest District 

(%) 

C000) 

C%) 

Benton 

53.5 

50.5 

53.6 

50.6 

Jasper 

36.8 

25.3 

92.0 

63.3 

Lake 

56.1 

42.1 

62.7 

47.1 

LaPorte 

60.8 

38.6 

64.7 

41.1 

Newton 

63.2 

59.3 

63.0 

59.2 

Porter 

47.2 

42.9 

53.1 

48.2 

Pulaski 

54.0 

48.1 

54.1 

48.2 

Starke 

38.8 

48.2 

38.1 

47.3 

White 

66.6 

51.7 

63.4 

49.2 

Total 477.0 

West Central District 

44.2 

544.7 

50.4 

Clay 

17.1 

18.1 

18.0 

19.1 

Fountain 

45.9 

44.6 

42.2 

41.0 

Montgomery 

■ 60.8 

46.3 

62,2 

47.4 

Owen 

23.2 

23.3 

19.2 

19.2 

Parke 

50.1 ' 

42.9 

44.4 

38.0 

Putnam 

39.8 

31.5 

36.2 

28.6 

Tippecanoe 

56.7 

43.7 

53.0 

40.8 

Vermillion 

34.4 

50.5 

33.5 

49.2 

Vigo 

20.2 

18.8 

21.7 

20.2 

Warren 

38.0 

39.9 

35.9 

37.6 

Total 

386.2 

36.0 

366.3 

34.2 
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Table 19. (continued) 


County 

Uncorrected 
Hectares Proportion 

Bias- 

Hectares 

Corrected 

Proportion 


(000) 

(%) 

(000) 

C%) 

Central District 




Bartholomew 

20.3 

19,5 

3.4 

3.3 

Boone 

19.6 

17.7 

5.6 

5.1 

Clinton 

17.1 

16.2 

2.4 

2.3 

Decatur 

38.5 

40.2 

37.3 

38.9 

Grant 

42.3 

38.8 . 

31.0 

28.4 

Hamilton 

35.8 

34.5 

38.0 

36.6 

Hancock 

29.6 

37.5 

30.6 

38.7 

Hendricks 

41.6 

38.5 

48.2 

44.6 

Howard 

31.8 

41.9 ' 

39.5 

52.0 

Johnson 

32.1 

39.3 

32'. 6 

39.9 

Madison 

51.3 

43.7 

46.7 

39.8 

Marion 

28.5 

27.4 

15.1 

14.5 

Morgan 

19.3 

18.3 

15.3 

14.5 

Rush 

38. 6 

36.4 

38.8 

36.6 

Shelby 

51.6 

48.7 

54.0 

51.0 

Tipton 

26.8 

39.7 

33.7 

49.9 

Total 

524.8 

33.2 

472.2 

29.9 

East Central District 




Blackford 

13.2 

30.4 

15.2 

35.2 

Delaware 

41.8 

40.5 

43.9 

42.6 

Fayette 

15.3 

27. 5 

13.3 

23.8 

Henry 

25.9 

25.0 

23.8 

23.0 

Jay 

27.3 

27.3 

30.9 

30.9 

Randolph 

46.8 

39.5 

49.0 

41.4 

Union 

13.9 

31.9 

12.4 

28.4 

Wayne 

26.5 

25.3 

23.0 

21.9 

Total 

210.7 

31.3 

211.5 

31.4 

State 

1598.7 

36.3 

1594.7 

36.2 
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Table 20. Uncorrected and bias-corrected Landsat estimates 

of hectares and proportions of soybeans in Indiana. 



Uncorrected 

Bias- 

Corrected 

County 

• Hectares 

Proportion 

Hectares 

Proportion 


(000) ■ 

(%) 

(000) 

w 

Northwest District 




Benton 

22. 6 

21.3 

20.3 

19.2 

Jasper 

22.8 

15.7 

22.4 

15.4 

Lake 

24.0 

18.0 

22.1 

16.6 

LaPorte 

32.9 

20.9 

32.9 

20.9 

Newton 

' 13.5 

12.7 

12.4 

11 . 6 

Porter 

22.6 

20.5 

21.4 

19.4 

Pulaski 

32.3 

28.8 

32.6 

29.1 

Starke 

18.3 

22.7 

18.5 

22.9 

White 

27.4 

21.3 

26.4 

20.5 

Total . 

216.4 

20.0 

209.0 

19.3 

West Central District 




Clay 

19.4 

20.6 

26.0 

27.6 

Fountain 

12.7 

12.3 

11.6 

11.3 

Montgomery 

23.1 

17.6 

24.4 

18.6 

Owen 

12.5 

12.5 

15.6 

15.6 

Parke 

11.1 

9.5 

9.3 

8.0 

Putnam 

16.9 

13.4 

21.1 

16.7 

Tippecanoe 

23.9 

18.4 

23.4 

18.0 

Vermillion 

8. 0 

11.8 

7.5 

11.0 

Vigo 

22, 2 

20.6 

29.6 

27.5 

Warren 

11. 5 

12.1 

12.2 

12.8 

Total 

161.3 

15.0 

180.7 

16.9 
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Table 20. (continued) 


County. 

Uncorrected 
Hectares Proportion 

Bias 

Hectares 

-Corrected 

Proportion 


(000) 

(%) 

(000) 

C%) 

Central District 

Bartholomew 

15.7 

15.1 

15.7 

15.1 

Boone 

38.4 

34.7 

38.6 

34.9 

Clinton 

37.0 

35.1 

37.2 

35.3 

Decatur 

15.5 

16.2 

15.6 

16.3. 

Grant 

22.8 

20.9 

21.1 

19.3 

Hamilton 

29.. 7 

28.6 

29.3 

28.2 

Hancock 

23.1 

29.2 

21.8 

27.6 

Hendricks 

■30.7 

28.4 

30.1 

27.9 

Howard 

22.5 

29.6 

22.0 

29..0 

Johnson 

33.3 

40.8 

34.9 

42.8 

Madison 

30.4- 

25.9 

28.1 

23.9 

Marion. 

12.3 

11.8 

11.7 

11.2 

Morgan 

9.8 

-9.3 

11.3 

10. 7 

Rush 

29.8 

28.1 

30'. 9 

29.2 

Shelby 

32.2 

30.4 

33.4 

31.5 

Tipton 

23.5 

34.8 • 

23.3 

34.4 

Total 

406. 7 

25.7 

405.0 

25.6 

East Central District 
Blackford 12.7 

29.3 

11.6 

26.7 

Delaware 

37.3 

36.2. 

- 33.0 

32.0 

Fayette 

12.4 

22.2 

12.3 

22.1 

Henry 

28.6 

27.6 

24.3 

23.4 

Jay 

34.6 

34.6 

33.3 

33.3 

Randolph 

43. 7 

36.9 

38.8 

32.8 

Union 

6.7 

15.3 

6.2 

14.3 

Wayne 

16.5 

15.7 

10.0 

9.5 

Total 

192.5 

28.6 

169.5 

25.2 

State 

976.9’ 

.22/2 

•964.2 

21.9 
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Figure 23. The correlation of Landsat and USDA/SRS 
estimates of the proportion „of corn in- 
Indiana counties. 
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FigttTe 24. The correlation o£ Landsat and USDA/SRS 
estimates of the area of corn in Indiana 
counties . 
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Figure 25, The correlation o£ Landsat and USDA/SRS 
estimates of the proportion of soybeans 
in Indiana counties. 
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Figure 26. The correlation of -Landsat and USDA/SRS 
estimates of the area of soybeans in 
Indiana counties. 
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consistently greater than the SRS estimates. On the other 
hand, the Landsat soybean estimates do not appear biased,, but 
are clearly more variable than either the corn or Kansas wheat 
estimates . 

More quantitative comparisons of the Landsat and SRS 
estimates at the county, as well as the district and ’’state" 
levels, are shown in Tables 21 -and 22. 

6.3.2 Accuracy of Estimates 

Only four .of Indiana's crop reporting districts were 
estimated using Landsat classification methods. These four 
districts together mahe up a "pseudo" state estimate which 
was tested against an SRS "pseudo" state estimate. The 
Landsat corn proportion and area estimates were significantly 
different from 'the SRS estimates. The soybean estimates were 
closer to SRS estimates, but the differences became significant 
at the 20% level for both proportion and area estimates. 
Assuming that the SRS estimates were- unbiased in these crop 
reporting districts, the estimates derived from the Landsat 
classification were, not as accurate- as the SRS estimates.' 

Tests were also performed for differences from SRS esti- 
mates on a crop reporting district basis. In -the Northwest 
and West Central Districts, corn estimates- were significantly 
different from SRS, while soybean estimates- were not signifi- 
cantly different. In the Central District, the reverse was 
found:- corn estimates were not significantly different from 
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Table 21. Comparison of USDA/SRS corn estimates and bias- 

corrected Landsat estimates of area and proportion 
of corn in Indiana. 


County 

Proportion 

(%) 

SRS Landsat Diff. 

SRS 

Hectares 
CO 00) 

Landsat 

Diff. 

Northwest District 






Benton 

44.9 

50.6 

• 5.7 

47.6 

53.6 

6.0 

Jasper 

43. -2 

63.3 

20.1 

62.8 

92.0 

29.2 

Lake 

20.0 

47.1 

27.1 

26.6 

62.7 

36.1 

LaPorte 

30.6 

41.1 

10.5 

48.1 

64.7 

16.6- 

Newton 

44.6 

59.2 

14.6 

47.4 

.63.0 

15.6 

Porter 

24.8 

48.2 

23.4 

27.3 

53.1 

25.8 

Pulaski 

39.4 

48.2 

8.8 

44.2 

54.1 

9.8 

Starke 

35.6 

47.3 

11.7 

28.7 

38.1 

9.4 

White 

41.6 

49.2 

7.6 

53.5 

63.4 

9.8 

Total 

35.8 

50.4 

14.6 

386.2 

544’. 7 

158.5 

West Central 

District 






Clay 

23.1 

19.1 

- 4.0 

21.8 

18.0 

- 3.8 

Fountain 

28.1 

41.0 

12.9 

29.0 

42.2 

13.2 

Montgomery 

39.5 

47.4 

7.9 

51.8 • 

•62.-2 

10.4 

Owen 

7.8 

19.2 

11.4 

7.7 

19.2 

11.5 

Parke 

20.0 

38.0 

18.0 

23.4 

44.4 

21.0 

Putnam 

21.3 

28.6 

7.3 

26.9 

36.2 

9.3 

Tippecanoe 

33.0 

40.8 

7.8 

42.8 

53.0 

10.2 

Vermillion 

20.1 

49.2 

29.1 

13.7 

33.5 

T9. 8 

Vigo 

16.8 

20.2 

3.4 

18.1 

21.7 

3.6 

Warren 

28.4 

37.6 

9.2 

27.0 

35.9 

8.8 

Total 

24.4 

34.2 

9.8 

262.2 

366.3 

104.1 
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Table 21. (continued) 


County 

SRS 

Proportion 

(%) 

Lands at 

Diff. 

SRS 

Hectares 

(000) 

Landsat 

Diff. 

Central District 






Bartholomew 22.8 

3.3 

-19.5 

23.7 

3.4 

-20.3 

Boone 

34.9 

5.1 

-29.8 

38.6 

5.6 

-33.0 

Clinton 

'44.8 

2.3 

-42.5 

47.2 

2.4 

-44.8 

Decatur 

36.9 

38.9 

2.0 

35.3 

. 37.3 

1.9 

Grant 

23.0 

28.4 

5.4 

25.1 

31.0 

5.8 

Hamilton 

30.2 

36.6 • 

6.4 

31.4 

38.0 

6 . 6 

Hancock 

32.5 

38.7 

6.2' 

25.7 

- 30.6 

4.9 

Hendricks 

23.0 

44.6 

21.6 

24.9 

48.2 

23.3 

Howard 

37.3 

52.0 

14.7 . 

28.3 

39.5 

11 . 1 

Johnson 

‘28.5 

39 . 9- 

11.4 

23.3 

32.6 

9.3 

Madison 

30.2 

39.8 

9.6 

35.5 

46.7 

11.2 

Marion 

- 10.8 

14.5 

3.7 

11.3 

* 15.1 

- 3'. 8 

Morgan 

17.9 

14.5 

- 3.4 

18.9 

15.3 

- 3.6 

Rush 

36. 0 

36.6 

0.6 

38.1 

38.8 

0.7 

Shelby 

37. 2 

51.0 

13.8 

39.4 

54.0 

14.7 

Tipton 

40.8 

49.9 

9.1 

27.6 

33.7 

6.1 

Total 

30.0 

29.9 

- 0.1 

474.3 

472.2 

- 2.1 

East Central 

District 





Blackford 

21.5 

■ 35.2 

13.7 

9.3 

15.2 

5.9 

Delaware 

26.4 

42.6 

16.2 

27.2 

43.9 

16.7 

Fayette 

26.0 

23.8 

- 2.2 

14.5 

13.3 

- l.,2 

Henry 

28.3 

23.0 

- 5.3 

29.3 

• 23.8 

- 5.5 

Jay 

16.7 

30.9 

14.2 

16.7 

•30.9 

14.2 

Randolph 

23. 7 

■ 41.4 

17.7 

28.1 

49.0 

21.0 

Union 

31.2 

28.4 

- 2.9 

13.6 

12.4 

- 1.2 

Wayne 

22.5 

21.9 

- 0.6 

23.6 

- 23.0 

- 0.6 

Total 

24.1 

31.4 

.7.3 

162.3 

211.5 

49.2 

State 

29. 2 

36.2 

7.0 

1285.0 

1594.7 

309.7 
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Table 22. Comparison of USDA/SRS soybean estimates and bias- 
corrected Landsat estimates of area and proportion 
of soybeans in Indiana. 


Proportion 

(%) 


Hectares 

( 000 } 


County 

SRS 

Landsat 

Diff . 

SRS 

Landsat 

Diff 

Northwest District 






Benton 

33.6 

19.2 

-14.4 

35.6 

20.3 

-15.2 

Jasper 

21.5 

15.4 

- 6.1 

31.3 

22.4 

- 8.9 

Lake 

10.8 

16.6 

5.8 

14.4 

22.1 

7.7 

LaPorte 

14.3 

20.9 

6.6 

22.5 

32.9 

10.4 

Newton 

21.4 

11.6 

- 9.8 

22.8 

12.4 

-10.4 

Porter 

13.6 

19.4 

5.8 

15.0 

21.4 

6.3 

Pulaski 

25.0 

29.1 

•4.1 

28.0 

32.6 

'4.6 

Starke 

15.9 

22.9 

7.0 

12.8 

18.5 

5.7 

White 

29.8 

20.5 

- 9.3 

. 38.3 

26.4 

-11.9 

Total 

20.4 

19.3 

- 1.1 

220.7 

209. Q 

-11.7 

West Central District 






Clay 

19.5 

27.6 

8.1 

18.4 

26.0 

7.6 

Fountain 

23.0 

11.3 

-11.7 

23.7 

11.6 

-12.1 

Montgomery 

23.1 

18.6 

- 4.5 

30.4 

24.4 

- 5.9 

Owen 

5.9 

15.6 

9.7 

5.9 

15.6 

9.7 

Parke 

14.1 

8.0 

- 6.1 

16.5' 

9.3 

- 7.1 

Putnam 

13.9 

16.7 

2.8 

17.5 

21.1 

3.6 

Tippecanoe 

22.2 

18.0 

- 4.2 

28.9 

23.4 

- 5.5 

Vermillion 

14.9 

11.0 

- 3.9 

10.2 

7.5 

- 2.7 

Vigo 

13.6 

27.5 

13.9 

14.6 

29.6 

15.0 

Warren 

25.9 

12.8 

-13.1 

24.7 

12.2 

-12.5 

Total 

17.8 

16.9 

- 0.9 

190.8 

180.7 

-10.1 
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Table 22. (continued) 


County 

SRS 

Proportion . 

(%) 

Landsat Diff. 

SRS 

Hectares 

(000) 

Landsat 

Diff. 

Central District 
Bartholomew 14.1 

15.1 

1.0 

14.7 

15.7 

, 1.1 

Boone . - 

23.5 

34.9 

11.4 

26.0 

38.6 

1-2.6 

Clinton 

27.3 

35.3 

8,0 

28.8 

37.2 

8.4 

Decatur 

15.1 - 

16.3 

1.2 

14.4 

15.6 

1.2 

Grant 

26.3 

19.3 

- 7.0 

28.7 

-21.1 ' 

- 7.7 

Hamilton 

22.0' 

28.2 

6.2 

22.8 

- 29.3 

■ 6/5 

Hancock 

27.0 

27.6 

0.6 

21.3 

21.8 

0.5 

Hendricks 

19.1 

27.9 

8.8 

• 20.6' 

-30.1 

9.5 

Howard 

27. 8 

29.0 

1.2 

21.1 

22.0 

0.9 

Johnson 

16.7 

42.8 

26.1 

13.6 

34.9 

21.3 

Madison 

24.1 

23.9 

- 0.2 

28.3 

28,1 

- 0.3 

Marion 

8.6 

11, 2 

2.6 

9,0 

- 11.7 

2.7 

Morgan 

11.6 

10.7 

- 0. 9 - - 

12.2 

■ 11.3 

- 0..9 

Rush 

22.1 

29.2 

7.1 

23.4 

■ 30.9 

7.5 

Shelby 

21. 5 

31.5 

-10.0 

22.8 

33.4 

10.6 

Tipton 

29.5 

34.4 

4.9 

20.0 

23.3 

3.3 

Total 

20. 7 

25.6 

4.9 

327.7 

405.0 

77.3 

East Central- District 
Blackford 27.1 

26.7 • 

- 0.4 

11.7 

11.6 

-.0.2 

Delaware 

23.2 

32.0 

8.8 

23.9 

33.0 

9.1 

Fayette 

13.0 

22.1 

9.1 

7.2 

12.3 

5.1 

Henry 

20.4 

23.4 

3.0 

21.1 

24.3 

3 . 1 

Jay 

26 . 9 

33.3 

6.4 

26.9 

33.3 

-6 . 4 

Randolph 

28.1 

32.8 

4.7 

33.3 

38.8 

5.5 

Union 

13.7 

14.3 

0.6 

6.0 

' 6.2 

0.3 

Wayne 

13, 5 

9,5 

- 4.0 

14.2' 

10.0 

- 4.2 

Total 

21.5 

25.2 

3, 7 

144.3 

' 169.5 

25.2 

State 

20.1 

21.9 

1.8 

88 3 . 5 

964.2 

80.7 
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SRS while soybean estimates were different. In the East 
Central District, both corn and soybean estimates differed 
significantly from SRS estimates at the 25% level. 

In conclusion, compared to SRS, the Landsat estimates of 
corn area and proportion were consistently overestimated. This 
is attributed in part to the spectral similarity of corn to 
other cover types, particularly trees, as well as to factors 
mentioned earlier such as boundary pixels. Because the corn 
estimates, although biased, were correlated with the SRS esti- 
mates, a regression technique such as described by Wigton [26] 
might be effectively used if sufficient "ground truth" data 
were available to determine the magnitude of the bias. On the 
other hand, the large variation present in soybean estimates 
would make it infeasible to attempt such a correction. When 
aggregated, however, the soybean estimates were reasonably close 
to the SRS estimates. 

.One further factor, perhaps accounting for some of the 
differences in the Landsat and SRS estimates, is that the SRS 
county and district estimates used for comparison are prelim- 
inary and may be revised before the final estimates are pub- 
lished in 1977. This possibility was identified when 1974 
estimates were examined for use in regression equations to 
predict crop areas in counties for which Landsat data were 
not analyzed. 

In November 1976, revised 1974 county estimates of corn 
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and soybean acreages were published by SRS. At first glance, 
these estimates seemed to be different from the preliminary 
estimates. For prediction of crop acreages where historical 
data was used (either as an estimate or in a regression} the 
preliminary figures were used to simulate real-time estimation. 
However, in a test on a few counties, a regression equation 
using the revised estimates appeared to give better prediction - 
for' 1975. 

The Landsat estimates for corn and soybeans did differ 
from the available SRS estimates which were preliminary. 

Looking at the changes in the 1974 estimates, it seems possible 
that the SRS revised estimates may be enough different from the 
estimates used for comparison that the Landsat estimates may 
not differ (at least not so much)- when compared to the revised 
figures. It is unfortunate, however, that the revised 1975 
estimates will not be available until late in 1977. 

To evaluate the difference between the preliminary and 
revised estimates on a county basis, the relative difference 
of the preliminary estimate from the revised estimate was cal- 
culated. These are presented for each crop and each county in 
Table 23. Relative differences were as great as 33.3%. This 
extreme figure- occurred in a county with a very small corn and 
soybean production, but other large relative differences of 
10 to 20% occurred where these crops were more important. The 
differences in hectares of the preliminary from the revised 
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estimates are also given in Table 23. Some estimates have 
changed by as much as 4000 hectares. 

6.3.3 Precision of Estimates 

The variance of the corn and soybean estimates can be cal- 

A 

culated from the binomial assumptions. If p c represents the 


bias -corrected estimate of proportion corn in a county and p s 
represents the bias-corrected estimate of proportion soybeans 
in a county, then 


V Cp c ) 



n-1 


(1-f) and 


fi 

v (P s ) 



n-1 


d-f), 


where n is the number of pixels classified in the county and 
f=^ where N is the total number of pixels in the county. 

TheSRS sampling error is not known, but the sampling 
error of Landsat estimates is very small in comparison as it is 
very small absolutely. Sample standard deviations and coeffi- 
cients of variation for Landsat estimates are presented in 
Tables 24 and 25. The standard deviations for the crop 
reporting districts and for the state were calculated consid- 
ering the sample as stratified with each county considered a 
stratum. As in Kansas, the sampling error of the state, 
district, and covin ty crop area estimates is very small. 
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Table 23. Differences of USDA/SRS preliminary 1974 estimates 
from revised estimates. 



Relative 

Difference 


Difference 

in 


(%) 

Hectares 

County 

Corn Soybeans 

Corn Soybeans 


Northwest District 


Benton 

-4.7 

6.0 

-2145.7 

2267.2 

Jasper 

-5.0 

4.4 

-3238.9 

1457.5 

Lake 

-4.2 

6.0 

-1133.6 

931.2 

XaPorte 

-0.1 

-3.8 

-40.5 

-890.7 

Newton 

-5.1 

-3.5 

-2388.7 

-850.2 

Porter 

-1.0 

-3.1 

-283.4 

-485.8 

Pulaski 

1.0 

A. 7 

404.9 

1417.0 

Starke 

0.4 

9.8 

121.5 

1295.5 

White 

-2.6 

4.0 

-1376.5 

1578.9 

•th Central District 
Carroll 

-0.9 

2.5 

-404.9 

566.8 

Cass 

• -2.8 

6.4 

-1052.6 

1417.0 

Elkhart 

5.8 

-3.2 

1619.4 

-445.3 

Fulton 

-1.0 

5.1 

-283.4 

931.2 

Kosciusko 

-2.9 

-4.0 

-1174.1 

-850.2 

Marshall 

3.8. 

'-5.4 

1295.5 

-1012.1 

Miami 

3.2 

-6.2 

1012.1 

-1214.6 

St. Joseph 

2.7 

-6.9 ’ 

769.2 

-1012.1 

Wabash 

-0.9 

-7.6 

-283.4 

-1700.4 


Northeast District 
Adams 
Allen 

DeKalb 

Huntington 

LaGrange 

Noble 

Steuben 

Wells 

Whitley 


2.4 

-8.1 

-3.2 

-2.3 

6.4 

13.3 

-1.0 

5.0 

-1.0 

-7.6 

-0.9 

^3.2 

6.0 

13.6 

2.1 

0.7 

-0.9 

7.3 


566.8 -2267.2 

-1012.1 -890.7 

1093.1 2510.1 

-242.9 1417.0 

-202.4 -485.8 

-242.9 -404.9 

1012.1 850.2 

566.8 242.9 

-202.4 1336.0 
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Table 23. (continued) 


County 


Relative 

Difference 

(%) 

Corn Soybeans 


Difference 

in 

Hectares 

Corn Soybeans 


West Central District 


Clay 

-9.2 

-15.4 

-1740.9 

-2955.5 

Fountain 

4.5 

-1.9 

1336. 0 

4 -485.8 

Montgomery 

-IvO 

-7.7 

-485. 8 

-2550.6 

Owen 

17.1 

6.9 

1295.5 

445.3 

Parke 

4.4 

5.4 

1012,1 

. 931.2 

Putnam 

-6.8 

0.6 

-1619.4 

121.5 

Tippecanoe 

-1.0 

-4.9 

-404.9 

-1538.5 

Vermillion 

24.2 

11.6 

3279.4 

1295.5 

Vigo 

6.2 

0.7 

1052.6 

121.5 

Warren 

6.4 

0.6 

1781.4 

161.9 


Central District 
Bartholomew 

1.-8 

-1.5 

445.3 

-242.9 

Boone 

10.3 

-4.0 

3684.2 

-1133.6 

Clinton 

-0.9 

-0.6 

-404.9 

-202.4 

Decatur 

2.5 

0.7 

890.7 

121.5 

Grant 

0.6 

-6.7 

161.9 

-1943.3 

Hamilton 

-1.0 ' 

-8.2 

-2*83.4 

-2064.8 

Hancock 

-0.9 

-0.7 

-242.9 

-161.9 

Hendricks 

2.7 

-3.3 

647.8 

-769.2 

Howard 

-7.1 

10.1 

-1862.3 

2186.2 

Johnson 

. 5.9 

-0.8 

1376.5 

-121.5 

Madison 

-4.6 

-13.4 

-1619.4 

-4048.6 

Marion 

2.4 

5,0 

283.4 

485.8 

Morgan 

-0.9 

, 9.7 

-161.9 

1295.5 

Rush 

1.1 

0.7 

445.3 

161.9 

Shelby 

-4.8 

0.7 

-1902. .8 

161.9 

Tipton 

• 5.3 

■8.0 

1498.0 

1781.4 

East Central District 
Blackford 

3.3 

0.6 

323.9 

81.0 

Delaware 

-0.9 

-3.0 

-242.9 

-769.2 

Fayette 

-0.9 

0.5 

-121.5 

40.5 

Henry 

-8.4 

-2.7 

-2469.6' 

-607.3 

Jay 

14.0 

2.1 

2388.7 

607.3 

Randolph 

1.8 

2.9 

526.3 

1052.6 

Union 

-0.9 

12.4 

-121.5 

850.2 

Wavn o 

■ 2.5 

10.4 

566.8 

1740.9 
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Table 23. (continued). 


Relative Difference 

Difference in 

(%) Hectares . 


County 

■Corn 

Soybeans 

Corn. 

Soybeans 

Southwest District 





Daviess 

3.1 

-2.0 

931.2 

-283.4 

Dubois 

2.8 

0.7 

607.3 

40.5 

Gibson 

-1.0 

6.2 

-404.9 

1376.5 

Greene 

-2.2 

-6.5 

-404.9' 

-68*8.3 

Knox 

7.9 

-1.3 

3967.6 

-283.4 

Martin 

-1.1 

22.2 

-81.0 

404.9 

Pike 

-0.8 

9.9 

-121.5 

890.7 

Posey 

4.1 

4.6 

1295.5 

971.7 

Spencer 

-10.6 

3.0 

-1578.9 

526.3 

Sullivan 

2.7 

7.3 

607.3 

1336.0 

Vanderburgh 

8.2 

-1.7 

1093.1 

-202.4 

Warrick 

-3.8 

-13.9 

-5-26.3 

-1700.4 


South Central District 
Brown 

0.0 

-33.3 

0.0 

-161.9 

Crawford 

0.0 

8.3 

0.0 

81.0 

Floyd 

0-.0 

30.0 

0.0 

242.9 

Harrison 

-16.9 

1.0 

-1457.5 

40.5 

Jackson 

4.0 

12.2 

971.7 

1700.4 

Lawrence 

-0.9 

24.1 

-81.0 

850.2 

Monroe 

-1.1 

10.6- 

-40.5 

202.4 

Orange 

-12.6 

1.2 

-1174.1 

40.5 

Perry 

-6.6 

1.4 

-242.9 

40.5 

Washington 

-23.6 

0.7 

-4048.6 

40.5 


Southeast District 
Clark 
Dearborn 
Franklin 
Jefferson 
Jennings 
Ohio 
Ripley 
Scott 

Switzerland 


-3.3 

0.6 

- + 1 8 • 2 

-15.7 

-7.7 

5.9 

-2.9 

-11.4 

11.5 

8.4 

-1.8 

-17.6 

-0.9 

■ 12*. 0 

-0.8 

25.0 

-1.4 

0.0 


-242.9' 

40.5 

-890.7 

-445.3 

-12-95.5 

• 445'. 3 

-202.4 

-890.7 

1498.0 

890.7 

-40.5 

-121.5 

-121.5 

1700.4 

-40.5 

1255.1 

-40.5 

0.0 
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Table 24. Estimates of the standard deviations and coefficients 
of variation of Landsat estimates of corn in Indiana. 


AREA ESTIMATE- 


STANDARD 

HECTARES DEVIATION 


PROPORTION ESTIMATE 


(%) 


STANDARD 

DEVIATION 


COEEFICIENT 

OF 

VARIATION 


(000 HA) 


(HA) 


m 


(%) 


NORTHWEST DISTRICT 
BENTON 
JASPER 
LAKE' 

LAPORTE 

NEWTON 

PORTER 

PULASKI 

STARKE 

WHITE 

TOTAL- 


53.6 

92.0 

62.7 

64.7 

63.0 

53.1 

54.1 

38.1 
63.4 


195.97 
499.30 
477.08 
510.06 
467.53 
428.55' 
435.87 
352., 25 
208.11 


544.7 1239.02 


50.6 

63.3 

47.1 
41-.1 

59.2 

48.2 

48.2 

47.3 
49.2 

50.4 


0.1849 
0.3435 
0.3582 
0 . 3238 
0.4390' 
0.3892 
0.3885 
0.4371 
0.1616 

0.1147 


0,37 

0.54 

0.76 

0.79 

0.74 

0.81 

0.81 

0.92 

0,33 

0.23 


WEST CENTRAL OISTRICT 



CLAY 
FOUNTAIN 
MONTGOMERY 
OWEN 
PARKE 
PUTNAM 
TIPPECANOE 
VERMILLION 
VIGO 
WARREN 

TOTAL 


CENTRAL DISTRICT 
BARTHOLOMEW . 
BOONE 
CLINTON 
DECATUR 
GRANT 
HAMILTON 
HANCOCK 
HENDRICKS 
HOWARD 
JOHNSON' 
MADISON 
MARION ' 
MORGAN 
RUSH 
SHELBY 
TIPTON 


18 

42 

62 

19 

44 

36 

53 

33 

21 

35 


.0 

.2 

.2 

.2 

.4 

.2 

.0 

.5 

.7 

.9 


3.4 
5,6 

2.4 
37.3 

31.0 

38.0 
30.6 

48.2 

39.5 

32.6 

46.7 

15.1 

15.3 

38.8 
54.0 
33.7 


233.84 

423.08 
471.14 

379.55 
592.32 

451.09 

200.56 

342.09 
342.62 
196.02 


366.3 1211.80 


153.59 
191.23 

127.60 
.397.20 
177.28 
405.14 

154.32 
432.73 

361.32 
365.05 
191.20 
424.45 
298. 40 
400.08 
421.18 
341.94 


TOTAL 


472.2 1289.24 


19.1 

41.0 
47.4 

19.2 

38.0 
28.6 
40.8 

49.2 

20.2 

37.6 

34.2 


3.3 
5.1 

2.3 

38.9 

28.4 

36.6 

38.7 
44.6 

52.0 

39.9 

39.8 

14.5 

14.5 

36.6 

51.0 

49.9 

29.9 


0.2479 

0.4113 

0.3588 

0.3805 

0.5069 

0.3567 

0.1545 

0.5020 

0.3186 

0.2056 

0.1130 


0.1474 

0.1728 

0.1210 

0.4147 

0.1625 

0.3899 

0.1953' 

0.4005 

0.4759 

0.4473 

0.1629 

0.4075 

0.2837 

0.3775 

0.3974 

0.5056 

0.0816 


1.30 

1.00 

0.76 

1.98 

1.33 

1.25 

0.36 

1.02 

1.58 

0.55 

0.33 


4.47 

3.39 

5.26 

1.07 

0.57 

1.07 

0.50 

0.90 

0.92 

1.12 

0.41 

2.81 

1.96 

1.03 

0.78 

1.01 

0.27 


EAST CENTRAL DISTRICT 


BLACKFORD 

DELAWARE 

FAYETTE 

HENRY 

JAY 

RANDOLPH 

UNION 

WAYNE 

TOTAL 


15.2 
43.9 

13.3 

23.8 

30.9 

49.0 

12.4 

23.0 


260,39 

720.23 

401.80 
354.60 
174.15 
202.96 

191.81 
160.47 


211.5 1003.60 


35.2 

42.6 

23.8 
23.0 

30.9 

41.4 

28.4 

21.9 

31.4 


0.6018 

0.6984 

0.7213 

0.3421 

0.1741 

0.1714 

0.4406 

0.1529 

0.1492 


1.71 

1.64 

3.03 

1.49 

0.56 

0.41 

1.55 

0.70 

0.48 


STATE TOTAL 


1594.7 2383.23 


36.2 


0.0541 


0.15 
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Table 25. Estimates of the standard deviations and coefficients of 
variation of Landsat estimates of soybeans in Indiana. 


“AREA estimate 


PROPORTION ESTIMATE 


STANDARD 

HECTARES DEVIATION 


(000 HA) 


(HA) 


- COEFFICIENT 
(%) STANDARD 'OF 

DEVIATION VARIATION 


(%> 


<*> 


NORTHWEST DISTRICT 
BENTON 20 

JASPER 22 

LAKE 22 

LAPORTE 32 

NEWTON 12 

PORTER 21 

PULASKI 32 

STARKE 18 

WHITE 26 


.3 

.4 

.1 

.9 

.4 

♦ 4 
,6 
.5 

• 4 


TOTAL 209.0 


154,39; 

373.92' 

355.62; 

421.51 

304.63 

33.9,14 

396.22 

296.46 

168,05 

974.36 


19.2 

15.4 
16.6 
20i9 

m 

29.1 

22.9 

20.5 

19.3 


0.1457 

0.2572 

0.267-0 

0.2676 

0.2861 

0.3080 

0.3532 

0.3679 

0.1305 

0.0902 


0.76 

1.67 

1.61 

1.28 

2.47 

1.59 

1.21 

1.61 

0.64 

0.47 


WEST CENTRAL DISTRICT 




CLAY 
FOUNTAIN 
MONTGOMERY 
OWEN 
PARKE 
PUTNAM . 
TIPPECANOE 
VERMILLION 
VIGO 
WARREN 

TOTAL 


26 

11 

24 

15 

9 

21 

23 

7 

29 

12 


.0 

.6 

.3 

.1 

.4 

.5 

.6 

.2 


180.7 


265.92 

272.34 

367.15 

349.66 

331.06 

372.32 

156.78 

214.10 

381.05 

135,20 

9*0.49 


27.6 
11,3 

18.6 
15.6. 

8.0 

16.7 
18.0 

H:8 

12.8 

16.9 


0,2820 

0.2647 . 

0,2796 

0.3505 

0.2833 

0.2944 

0.1208 

0.3142 

0.3544 

0.1418 

0.0877. 


1 • 02 r ‘ 
2.34 
1.50 
2.25 
3,54- 
1.76 
0.67 
2.86 
1.29 
1.11 

0.52 


CENTRAL DISTRICT 
BARTHOLOMEW 
BOONE 

CLINTON • 

DECATUR 

GRANT 

HAMILTON 

HANCOCK 

HENDRICKS 

HOWARD 

JOHNSON 

MADISON 

MARION 

MORGAN 

RUSH 

SHELBY 

TIPTON 

TOTAL 


1 


5.7 

8.6 

37.2 

15.6 

'liti 

21.8 

30.1 

22.0 

34.9 
.28,1 

11.7 

11.3 

30.9 

33.4 
23.3 


307.;84 

414.32 

■406.79 

300.93 

155.15 

378.45 
141.64 

390.45 

328.17 
368.85 
166.59 

380.17 
261.96 
377.63 
391.37 
324.87 


405.0 1320.84 


m 

35.3 

16.3 

19.3 
28.2 

27.6 
27.9 
29.0 

42.8 

23.9 

11.2 

10.7 
29.2 

31.5 

34.4 

25.6 


0,2955 

0.3745 

0.3857 

0.3142 

0,1792 
0.3614 
0.4323 
0.4519 
0.1419 
0 .,3850 
0.24.90 
0.3563 
0.3693 
0.4804 

0.0836 


1.96 

1.07 

1.09 

1.93 

0.74 

1.29 
0,65 

1.30 
1,49 
1.06 
0.59 
3.26 
2,33 
1.22 
1.17 
1.40 

0.33 


EAST central district 


BLACKFORD 

DELAWARE 

FAYETTE 

HENRY 

JAY 

RANDOLPH 

UNION 

WAYNE 

TOTAL 


11.6 

33.0 

12.3 

24.3 

33.3 
38.8 

6.2 

10.0 

169.5 


241.19 

679.42 

391.48 

356.74 

177.62 

193.46 

148.90 

113.77 

951.14 


26.7 

32.0 

22.1 

23.4 

33.3 
32.6 

14.3 
9.5 

25.2 


0.5574 

0.6508 

0.7027 

0,3442 

0.1776 

0.1634 

0.3421 

0.1084 

0.1414 


2,09 

2.06 

3.18 

1.47 

0.53 

0.50 

2.39 

1.14 

0.5fo 


STATE TOTAL 


964.2 2118.91 


21.9 


0.0481 


0 , 22 ' 
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6.4 Regression Estimation for Corn and Soybeans in A-reas 

Without Landsat Coverage 

Landsat data was not analyzed due primarily to cloudiness 
for five districts in Indiana: North Central, Northeast, 

Southwest, South Central, and Southeast. Since estimates of 
the area and proportion of corn and soybeans in these counties 
were required, a prediction equation was developed for each 
crop using the 43 counties which had been classified with 
Landsat data. The Landsat estimates were written as a function 
of historical crop production in the two previous years, and 
acres in the county. These equations were then used to predict 
area and proportion estimates for corn and soybeans in the 
counties which did not have Landsat coverage. 

To estimate the area of corn, the counties classified in 
Indiana were divided into three groups according to the USDA/SRS 
1974 preliminary estimates of acreage of corn (Table 26) . The 
rationale for dividing the counties into groups was to make the 
variances more homogeneous within groups. A prediction equation 
was formulated for each of the groups using the variables: 
acres in the county, the 1973 SRS revised estimate and the 1974 
SRS preliminary estimates of acres of corn harvested in the 
county. The counties in which the area of corn was to be pre- 
dicted fell into one of these three groups according- to the 
same criterion; however, if the number of acres in the county 
or the 1973 or 1974 corn acreage estimate fell outside the 
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Table 2'6. Groupings used for regression- estimation and 
the number of counties per group. 


Group 

Counties 
Lands at 

with 

del t cl 

Counties 
to be 
predicted 

USDA/SRS 1974 
preliminary 
acreage estimates - 



For 

Corn Estimation 

1 

10 


8 

<50,000 acres 

2 

21 


13 

50-90,000 acres 

3 

.12 


3 

>90,000 acres 



For 

Soybean Estimation 

1 

12 


12 

<40,000 acres 

2 

14 


14 

40-60 , 000 . acres 

3 

17 


2 

>60,000 acres 
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appropriate range, historical estimation was used. For 26 
counties, historical estimates were used. 

The prediction equations found are given as follows: for 

the first group, 

y = 3.98 + 0.01 X! - 0.46 x 2 + 0.81 x 3 (R 2 = 0.31); 

for the second group, 

^ 2 
y = - 19.33 + 0.10 Xi + 1.22 x 2 - 0.67 x 3 (R = 0.30); 

for the third group, 

2 

y = - 69.36 + 0.17 x x - 1.80 x 2 + 2.33 x 3 CR = 0.49) 

where x 3 is the number of thousands of acres in the county, x 2 

is the acreage of corn grown in a county in 1973 in thousands, 

and x 3 is the acreage of corn grown in a county in 1974 in 

/\ 

thousands. The "pseudo” Landsat estimate, y, is given in 
thousands of hectares. 

For soybean-estimation, the counties were again divided 
into three groups, but this time the groupings were based upon 
the USDA/SRS 1974 preliminary soybean estimates (Table 26), 

For 21 counties, historical estimation was used. The predic- 
tion equations found are given as follows:, for the first group, 

^ “ 2 
y * - 2.08- + 0.02 xi + 0.25 x 2 + 0.17 x 3 (R = 0.32); 

for the second group, 
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2 


0 . 20 ) 


y " - 6.71 + 0.04 x i + 0.33 x 2 (R = 

(the variable X 3 did not add sufficient 
information to enter the regression) ; 

and for the third group, 

y = 29.87 - 0.03 xi - 0.19 x 2 + 0.27 x 3 (R 2 = 0.02); 

where x 2 is the number of thousands of acres in the county, x ' 2 
is the acreage of soybeans grown in a given county in 1973 in 
thousands, and x 3 is the acreage of soybeans grown in a county 

A 

in 1974 in thousands. The ''pseudo ' 1 Landsat estimate, y, is 
given in thousands of hectares. Estimates were then made using 
these six equations and historical data (Tables 27 and 28). 

The estimates made by the prediction equations were gen- 
erally not of as high an accuracy as the SRS estimates. Esti- 
mates of corn area and proportion were not significantly - 
different from SRS estimates' at the' 25% level in the Northeast 
and Southeast Districts. In all other districts, however, and 
for soybean area and proportion estimates in all districts, the 
regression estimates were significantly different from those 
obtained by SRS. 
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Table 27. Regression estimates of area and proportion of 
corn in counties for which usable Landsat data 
was not available. 


Hectares Proportion 

( 000 ) (%) 

County * SRS Reg. Diff. SRS Reg. . Diff. 


North Central District 


Carroll 

H 

44.2 

43.4 

- 0.8 

45.6 

44.8 

- 0.8 

Cass 

H 

38.7 

37.0 

- 1.7 

36.0 

34.4 

- 1.6 

Elkhart 

2 

29.8 

42.2 

12.4 

24,6 

34.8 

10.2 

Fulton 

2 

31.5 

36.6 

5.1 

33.1 

38.5 

5.4 

Koscuisko 

3 

43.7 

37.7 

- 6.0 

32.3 

27.9 

- 4.4 

Marshall 

2 

35.5 

44.1 

8.6 

30.9 

38.3 

7.4 

Miami 

2 

33.2 

36.8 

3.6 

33.7 

37.4 

3.7 

St. Joseph 

2 

28.9 

37.5 

8.6 

23.9 

31.0 

7.1 

Wabash 

2 

33.4 

43.7 

10.3 

30.6 

40.1 

9.5 

Total 


318.9 

359.0 

40.1. 

31.9 

35.9 

4.0 

Northeast District 


* 





Adams 

2 

23.0 

23.4 

0.4 

25.7 

26.2 

0.5 

Allen 

H 

34.6 

30.6 

- 4.0 

19.9 

' 17.. 6 

- 2.3 

DeKalb 

1 

18.6 

22.6 

4.0 

19.7 

23.9 

4.2 

Huntington 

2 

23.5 

28.4 

4.9 

23.3 

28.1 

4.8 

Lagrange 

H 

-25.5 

20.8 

- 4.7 

26.0 

21.2 

- 4.8 

Noble 

2 

27.1 

30.8 

3. T 

25.5 

29.0 

3. '5 

Steuben 

1 

17.5 

23.1 

5.6 

21.8 

' 28.8 

7.0 

Wells 

2 

25.9 

27.5 

.1,6 

27.2 

28.8 

1.6 

Whitley 

H 

22.6 

21.3 

- 1.3 

26.0 

24.5 

- 1.5 

Total 


218.3 

228.5 

10.2 . 

23.6 

24.7 

1.1 
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Table 27. (continued) 


Hectares ' Proportion 

( 000 ) ; (%) 


County 

* 

SRS 

Reg. 

Diff. 

SRS 

Reg. 

Diff. 

Southwest District 
Daviess 2 

30.7 

39.4 

8.7 

27.5 

•35;3 

. 7.8 

Dubois 

H 

23.2 

22.3 

- 0.9 

20.7 

19.9 

- 0.8 

Gibson 

3 

43.1 

42.0 

- 1.1 

33.3 

■ 32.5 

- 0.8 

Greene 

H 

21.2 

18.3 

- 2.9 

14.9 

12.9 

-.2.0 

Knox 

3 

52.0 

86.7 

34.7 

38.8 

64.7 

25.9 

Martin 

H 

8.7 

7.5 

- 1.2 

9.7 

8.4 

- 1.3 

Pike 

1 

15.1 

19.5 

4.4 

17.4 

22.5 

'5.1 

Posey 

2 

33.2 

38.5 

5.3 

31.0 

35.9 

4.9 

Sp encer 

1- 

18.8 

17.4 

- -1.4 

18.3 

17.0 

- 1.3 

Sullivan 

2 

23.9 

39.2 

15.3 

20.2 

33.1 

12.9 

Vanderburgh 

1 

13.8 

20.2 

6.4 

22.1 

32.4 

10.3 

Warrick 

1 

14.9 

19.1 

4.2 

14,7 

18.9 

4 . 2 

Total 


298.6 

370.1 

71.5 

23.0- 

28.5- 

• 5.5 

South Central 
Brown 

District 
H 1.2 

1.2 

O.'O 

1.4 

- 1.4 

' 0.0 

Crawford' • 

•H— 

- 2.1 

1.9 

- 0.2 

-2.6 

2.-4 

- 0.2 

Floyd 

H 

1.4 

1.3 

- 0.1 

3.6 

3.4 

- 0.2 

Harrison 

H 

8.3 

7.2 

- 1.1 

6.7 

5,8 

- 0.9 

Jackson . 

H 

■27.0 

25.3 

- 1.-7 

20.0 

18.8 

- 1.2 

Lawrence 

H 

9.7 

9.2 

- 0.5 

8.2 

7.7 

- 0.5 

Monroe 

H 

3.7 

3,6 

- 0.1 

3.7 

3.6 

- ' 0 . 1 

Orange 

H' 

10. 1 

'8.2 

- 1/9 

9.6 

7.8 

- 1.8 

Perry 

H 

4.4 

3.4 

- 1.0 

4.. 4 

3.4 

- 1.0 

Washington 

H' 

1-8.1 

13.1 

- 5.0 

13.5 

9, 8 

-3.7 

Total 


'8'6-. 0 

74,4 

-11, 6 

8.4 

7.3 . 

- 1.1 
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Table 27. (continued) 


County 

* 

SRS 

Hectares 

(000) 

Reg. Diff. 

Proportion 

(*) 

SRS Reg. Diff. 

Southeast District 




x 



Clark 

H 

7.4 

7.1 

- 0.3 

7.4 

’ 7.1 

- 

0.3 

Dearborn 

H 

5.2 

4.0 

- 1.2 

6 . 6 

5.0 

- 

1.6 

Franklin 

1 

16.8 

20.9 

4.1 

16.5 

20.5 


4.0 

Jefferson 

H 

7.7 

6.9 

- 0.2 

8.1 

7.3 

- 

0.8 

Jennings 

1 

12.5 

21.6 

9*1 

12.8 

22.1 


9.3 

Ohio 

H 

• 2.0 

2.2 

0.2 

8.9 

9.8 


0.9 

Ripley 

H ' 

12.8 

12.9 

0.1 

11.2 

11.3 


0.1 

Scott 

H 

4.9 

4.7 

- 0.2 

9.8 

9.4 

“ 

0.4 

Switzerland H 

3.1 

2.8 - 

- 0.3 

5.4 

4.9 

“ 

0.5 

Total 


72.4 

83.1 

10.7 

10.1 

. 11.6 


1.5 


^Method of Estimation: H-historical ; 1, 2, and 3 refer to the 
groups defined-in Table 26. 




Table 28. Regression estimates of area and proportion of 
soybeans in counties for which usable Landsat 
data was not available. 


County 

* 

SRS 

Hectares 

(000) 

Reg. 

Diff . 

Proportion 
(%) . 

SRS Reg. Diff. 

North Central 

District 






Carroll 

2 

21.7 

24.8 

3.1 

22.4- 

25. 6 

3.2 

Cass 

2 

20.5 

23.5 

3.0 

19.1 

21.9 

2.8 

Elkhart 

1 

14.0 

21.0 

7.0 

11.5 

17.3 

5.8 

•Fulton 

2 

16.9 

20.3 

3.4 

17.8 

21.3 

3.5 

Koscuisko 

2 

21.1 

24.4 

3.3 

15.6 

18.0 

.2.4 

Marshall 

2 

17*3 

21.0 

3.7 

15.0 

18.3 

3.3 

Miami 

2 

18.3 

20.7 

2.4 

18.6 

21.0 - 

2.-4 

St. Joseph 

1 

14.3 

20.5 

6.2 

11.8 

16.9 

5.1 

Wabash 

2 

21, 8 

23.0 

1.2 

20.0 

21.1 • ' 

1.1 

Total ' 


165.9- 

199.2 

33.3’ 

16.6 

19.9 

3; 3 

Northeast District 







Adams 

H 

26.7 

25.8 

- 0.9 

29.9 

28.9 ' - 

1.0 

Alien 

■ H - 

' 34. .8 

37.3 

2.5 

20.0 

21.5 

1.5 

DeKalb 

2 

16,8 

20.5 

3.7 

17.8 

21.7 

3.9 

Huntington 

3 

27,8 

-28.9 

1.1 

27.5 

28.6 

1.1 

Lagrange 

H 

5.5 

5.9 

0.4 

5.6 

- 6.0 

0.4 

Noble 

1 

12 . 0 

17.6 

5,6 

11.3 

16. 6 

b . 3 

Steuben 

H 

5.3 

7.1 

. 1.8 

6, 6 

8.8 

2.2 

Wells 

•3 

31.9 

29.7 

- 2. ,2 

33.5 

31.2 

2.3 

Whitley 

2 

17, 2 

18.1 

0.9 

19.8 

20.-8 

1.0 

Total 


178,0 

190.9 

1-2.9 

19.2 

20.6. 

1.4 


138 


Table 28. (continued) 


Hectares Proportion 

( 000 ) (%) 


County 

* 

SRS 

Reg. 

Diff . 

SRS 

Reg. 

Diff. 

Southwest District 
Daviess 1 

12.5 

19.1 

6.6 

11.2 

17.1 

5.9 

Dubois 

H 

5.3 

5.8 

0.5 

4.7 

5.2 

. 0.5 

Gibson 

2 

20.0 

25.9 

5.9 

15.5 

20.0 ' 

4.5 

Greene 

1 

10.0 

17.0 

7.0 

7.0 

12.0 

5.0 

Knox 

2 

20.0 

25.3 

5.3 

14.9 

18.9 

4.0 

■ Martin 

H 

1.7 

2.2 

0.5 

1.9 

2.5 

0.6 

Pike 

1 

8.2 

13.0 

4.8 

9.4 

15.0 

5.6 

Posey 

2 

19.1 

23.2 

4.1 

17.8 

21.6 

3.8 

Spencer 

2 

17.0 

20.1 

3.1 

16.6 

• 19'. 6 

3.0 

Sullivan 

2 

16.4 

22.3 

5.9 

13.8 

18.8 

5 . 0 

Vanderburgh 

1 

10.8 

14.7 

3.9 

17.3 

23.5 

6.2 

Warrick 

1 

11.7 

15.6 

3.9 

11.6 

15.4 

3 . 8 

Total 


152.7 

204.2 

51.5 

11.8 

■ 15.7 

3.9 

South Central 
Brown 

District 
H ' 0.4 

0.3 

- 0.1 

0.5 

0.4 

- 0.1 

Crawford 

H ■ 

0.9 

1.1 

0.2 

1.1 

1.4 

0.3 

Floyd 

H 

0.8 

1.1 

0.3 

2.1 

2.9 

0.8 

Harrison 

H 

• 4.0 

4.2 

0.2 

3.2 

3.4 

0.2 

Jackson 

1 

13.4 

23.2 

9.8 

9.9 

17,2 

7.3 

Lawrence 

H 

3.4 

4.4 

1.0 

2.9 

3.7 

0.8 

Monroe 

H 

1.8 

2.1 

0.3 

1.8 

2.1 

0.3 

Orange 

H 

3.0 

3.3 

0.3 

2.9 

3.1 

0.2 

Perry 

H 

2.7 

3.0 

0.3 

- 2.7 

3.0 

0.3 

Washington 

H 

5.4 

5.7 

0.3 

4.0 

4 . 3 

0 . 3 

Total 


35.8 

48.4 

12.6 - 

3.5 

4.8 

1.3 
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Table 28. (continued) 


Hectares Proportion 

( 000 ) (%) 

County * SRS Reg. Diff . SRS Reg. Diff. 


Southeast District 


Clark H 
Dearborn H 
Franklin 1 
Jefferson H 
Jennings 1 
Ohio H 
Ripley 1 
Scott H 


Switzerland H 
Total 


6.0 

6.3 

0.3 

2.6 

2.4 

- 0.2 

6.8 

11.8 

5.0 

7.4 

6.9 

- 0.5 

10.2 

1 6 0 

5.8 

0.6 

0.6 

0.0 

13.5 

20.9 

7.4 

4.7 

6.3 

1.6 

2.0 

2.1 

0.1 

53.8 

73.3 

19.5 


6.0 

6.3 

0.3 

3.3 

3.0 

- 0.3 

6.7 

11.6 

4.9 

7.8 

7.3 

- 0.5 

10.4 

16.4 

6.0 

2.7 

2.7 

0.0 

11.8 

18.3 

6.5 

9.4 

12.6 

3.2 

3.5 

3.7 

0.2 

7.5 

10.2 

2.7 


*Method of Estimation: H-historical ; 1, 2, and 3 refer to the 
groups defined in Table 26. 
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7.0 SIGNIFICANT RESULTS AND CONCLUSIONS 


The first sections of this report described the rationale 
and background of this research, defined the objectives and 
experimental approach, and presented the results. Many differ- 
ent phases of our investigation have produced results which we 
believe are significant in the development of remote sensing 
technology, particularly for crop surveys. New techniques for 
handling and analyzing multispectral scanner data were devel- 
oped; crops were classified over larger areas than ever before. 
The results conclusively demonstrated the efficiency and appli- 
cability of computer-aided analysis techniques for estimating 
crop areas. The objectives and approach are briefly reviewed 
in this section; then the most significant results and conclu- 
sions are presented. 

The overall objective of the investigation was to develop 

and test techniques utilizing Landsat MSS data to identify and 

determine the areal extent and distribution of crops over large 

geographic areas. The specific objectives were: 

Using Landsat data and computer- implemented 
pattern recognition, classify the major crops 
from regions encompassing different climates, 
soils and crops. 
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Estimate crop areas for county and state size 
regions using identification data obtained 
from Landsat classifications. 

Evaluate the accuracy, precision and timeliness 
of crop estimates obtained from Landsat data. 

The test areas and crops were Kansas, winter wheat, and 
Indiana, corn and soybeans. The major steps of the experimental 
approach used were: 

Use aerial photography as reference data. for 
training the classifier. 

For counties without reference data, extend 
training statistics from adjacent counties 
having similar crops and soils. 

Classify and make area estimates from a 
systematic random sample of pixels distributed 
over an entire county. 

Adjust estimates for classification bias. 

Aggregate county estimates to district' and 
state levels. 

Perform quantitative statistical evaluation 
of results - using the area estimates made by 
USDA/SRS as a. standard of comparison. 

Landsat data acquired during March to June' for the counties 
in seven crop districts of Kansas were classified; estimates of 
the area of wheat in each of the 80 counties were made and 
compared to the corresponding estimates made by the USDA/SRS. 

The correlation of the USDA/SRS county estimates of wheat area 
to the Landsat estimates was 0.80. The wheat proportion esti- 
mates of 49% of the Landsat county estimates were within + 5% 
of the SRS estimates and 81% were within +_ 10%. At. the crop 
reporting district level there was a significant difference in 
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the Landsat and SRS estimates in only one of the seven dis- 
tricts. In that district the differences, although small, -were 
all in one direction. . For the state, the SRS estimate was 
4,555,000 hectares compared to the Landsat estimate of 4,613,000 
hectares, a relative .difference of only 1.27%. 

The coefficient of variation, a measure of the precision 
or sampling error, of the Landsat estimates was 0.06% compared 
to 4% for SRS estimates at the state level. The median coeffi- 
cient of variation of the Landsat county estimates was 0.60%. 

At all' levels, state, district, and county, the Landsat esti-i 
mates were extremely precise compared to the corresponding . 
USDA/SRS estimates. 

Landsat data acquired during July, August, or September 
for 43 counties in four districts were classified for the 
Indiana portion of the study. The corn and soybean classifica- 
tion performances and area estimates were not as accurate as for 
wheat in Kansas. The correlation coefficients for Landsat and 
SRS county estimates of the areas of corn and soybeans were 
0.67 and 0.56, respectively. The corn estimates were consist- 
ently high compared to SRS and the soybean estimates, although 

not '.biased as for corn, varied widely from SRS. There were 
also significant differences in the SRS and Landsat estimates at 
the district and state levels. As in Kansas, the Indiana 
Landsat estimates were very precise compared to the SRS esti- 
mates . 
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The generally lower level of performance in Indiana 
compared to Kansas is attributed to the greater number of crops 
and spectral classes to discriminate among; smaller, less 
homogeneous fields; less optimal timing of Landsat data acqui- 
sition; and less adequate reference or training data. A major 
difference between winter wheat identifi .ation in Kansas and 
corn and soybean identification in Indiana is that the crop 
calendar of winter wheat is different than most other cover 
types; whereas, corn and soybeans, both summer crops, have crop 
calendars similar to other cover types present, (i.e. are green 
at the same time] such as. oats, hay, pasture, and trees. In 
summary, the identification of corn and soybeans in Indiana is 
a much more difficult problem than winter wheat identification 
in Kansas. This fact was compounded by the lack of cloud-free 
Landsat data at critical times and inadequate reference data 
for optimal training of the classifier. 

Results in both Kansas and Indiana could be improved by 
the following changes which can be recommended based on the 
results obtained in this investigation. In the area of strat- 
ification there are two recommendations: first, apply a more 
systematic, objective procedure for subdividing the scene into 
homogeneous areas, and second, use classification units smaller 
than a county when a county falls into more than one stratum. 
Two improvements in the area of data acquisition x^rould be ben- 
eficial: first, coordinate aerial photography acquisition more 
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closely with the crop calendar and Landsat data acquisition; 
second, more timely delivery of Landsat data could be used to 
facilitate scheduling aerial photography missions. Finally, 
the computer costs for classification could be decreased by 
reducing the sampling fraction from 25% to either 6.25 or 4% 
without significantly affecting the accuracy or precision of 
the estimates. 

The overall conclusions of the investigation are: 

Landsat MSS data was adequate to accurately 
identify wheat in Kansas; corn and soybean 
estimates for Indiana were less accurate. 

Computer-aided analysis techniques can be 
effectively used to extract crop identification 
information from Landsat data. 

Systematic sampling of entire counties made 
possible by computer classification methods 
resulted in very precise area estimates at 
county, district, and state levels. 

Training statistics can be successfully 
extended from one county to other counties 
having similar crops and soils if the 
training areas sampled the total variation 
of the area to be classified. 

The synoptic view of Landsat provides the opportunity to 
obtain crop production information over very large areas, e.g. 
states and countries. . By using computer processing techniques 
to classify pixels distributed over entire counties, it is also 
possible to make accurate and precise estimates for local areas, 
e.g. counties. These capabilities combining satellite, sensor, 
and computer make a worldwide, and at the same time, a local 
crop production. information system possible. The procedures and 
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results of this investigation should be of particular interest 
to U.S. government "user" agencies including the .Statistical 
Reporting Service, the Foreign Agricultural Service, and the. 
Economic Research Service; international organizations such 
as the United Nations' Food and Agriculture Organization; and 
private firms such as grain exporting companies. 
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8.0 RECOMMENDATIONS 


The experiences and results of this research with Landsat 
data have indicated a number of recommendations which should be 
considered in designing and implementing future satellite 
sensor/data processing systems. These are as follows: 

Frequency of Data Collection : ■ The 18 day collection 
sequence available with Landsat- 2 proved to be inadequate for 
several phases of this study; although Landsat-1 data was used 
to fill in several gaps in the data, it was not readily avail- 
able. An 8 to 10 day cycle would.be much more satisfactory for 
crop surveys in the future. Because of frequent cloud cover 
problems, such an increase in frequency of coverage would assure 
a higher probability for collection of adequate quantity and 
quality of data during critical periods of the vegetative grow- 
ing season. More frequent coverage than 18 days will also be 
required for monitoring crop conditions. 

Wavelength Bands : Work with aircraft data and more 

recently with Skylab data has clearly shown the importance of 
the middle infrared' and thermal infrared portions of the 
spectrum for crop identification. Because the Landsat scanner 
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does not obtain data in these wavelength regions, we believe 
that the classification accuracies achieved are not as high 
as would be possible. Addition of at least one wavelength 
band in the middle infrared portion- of the spectrum (1.3-2. 6ym) 
and at least one channel in the 8-13. 5ym thermal infrared region 
in future satellite scanner systems will unquestionably allow 
significant improvements in -many of the results obtained, and 
in the utility' of this type- of satellite data. Further, the. 
narrower and more optimally placed visible and nea'r infrared 
bands of the proposed thematic mapper sensor on Landsat D will 
be a substantial improvement [21], 

Spatial Resolution : The 80 meter IFOV of the current 

Landsat MSS appears generally adequate for areas- having rela- 
tively large fields, but it is definitely a limitation in 
working in areas with field sizes of 10 hectares ,or less. The 
30 meter IFOV of the proposed thematic mapper sensor would be a 
major improvement in that it would greatly reduce the proportion 
of "mixed" field boundary -pixels and facilitate .locating field 
boundaries . 

Time of -Day- : To maximize the- signal/noise ratio and 

minimize the effect of 'shadows,' Landsat overpasses’ near solar 
noon would be optimal, -However., because of the normal mid-day 
build-up -of cumulus- clouds., it appears that the time of day 
utilized is nearly ideal:, and a change in. the time of data collec- 
tion ,is not recommended for future systems. 
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Delays in Receipt of Data : Lengthy delays in receipt of 

data in either image or tape format precluded the possibility 
of a rapid analysis of the data and subsequent field checking. 

It is highly recommended that a system be developed to get an 
intermediate quality product into the hands of the investigators 
within 2-4 days after data collection. If cloud cover was 
minimal and overall data quality appeared promising, ,the inves- 
tigator could then request tapes and final image product outputs 
and more intelligently schedule and utilize resources in collect- 
ing ’’ground truth." 

Reference Data for Training : The importance of high 

quality, accurate reference data for training the classifier 
should be emphasized. A multistage sampling system combining 
coordinated ground observations; large scale aerial photography; 
small scale, high’ altitude photography; and Landsat data would 
be ideal and insure the greatest accuracy in the classification 
of Landsat data. However, in most instances one or two of the 
stages are sufficient and as additional knowledge and under- 
standing of the multispectral responses of crops is gained, 
greater dependence can be placed on developing training statis- 
tics directly from the Landsat data. This approach is being 
utilized by LAC IE for wheat and should be developed -for other 
crops and regions. 

Geometric Correction and Multitemporal Registration : 
Although neither geometrically corrected or multitemp or ally 
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registered data were utilized in this investigation because of. 
the current high cost of obtaining such data, both kinds of 
preprocessing should be made routinely available in order to 
increase the utility and performance of Landsat data. In this 
investigation geometrically corrected digital data would have 
considerably simplified the task of obtaining fiefd and county 
coordinates. The ability to register multiple data sets is 
becoming increasingly important in that it allows the temporal 
dimension of the spectral measurements to be fully utilized, 
and will also alloitf satellite data to be effectively related to 
other maps. Future systems should provide a digital data format 
that has been geometrically corrected to a standard format base 
to facilitate data registration. 

Data Analysis Techniques : Improvements in data analysis 

techniques are required to fully achieve the potential infor- 
mation content of-multitemporal , spectral measurements acquired 
from space. The spatial dimension has been little used to 
date in computer-aided' data analysis, although spatial char- 
acteristics are known to, bear a great amount of information and 
are regularly used by photo interpreters. Still another aspect 
of satellite data analysis is the need to develop' methods for 
effectively working over the large geographic areas for which 
Landsat data is; obtained. The diversity of landscape patterns 
found over many areas of this size indicates that a logical 
first step in, the classification of Landsat data is to stratify 
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or divide the scene into units which are internally similar. 
Such a stratification will be helpful in constructing sampling 
frames which minimize the variance among sample units and in 
determining the boundaries of areas over which training statis- 
tics can be satisfactorily extended. 

Crop Yield Prediction : Although yield prediction or crop 

assessment was not an objective or within the scope of this 
investigation, there were indications as we analyzed the data 
that some of the observed variations in spectral response were 
due to- factors which are related to yield such as. amount of 
tillering, leaf area, and biomass. These relationships as well 
as the use of Landsat data to determine the extent and severity 
of catastrophic events such as drought should be explored in 
future studies. 

In closing, we believe considerable progress toward an 

operational, crop survey system was made as a result of this 

\ 

experiment. The~results conclusively demonstrated the effi- 
ciency and applicability of computer-aided analysis techniques 
for estimating crop areas. Many of the techniques used in the 
investigation could be transferred to an operational system 
capable of producing accurate and precise crop area estimates 
for local areas such as counties, as well as for larger areas 
such as states or countries. 
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Table Al, Summary of Landsat scenes and sources of training 
statistics used for classifications in Kansas. 


Source of 

Training Landsat' 

County Statistics Scene Date 


Northwest District 


Cheyenne 

(local) 

2165-16450 

July 

6, 

1975 

Decatur 

Norton 

2146-16392 

June 

17, 

1975 

Graham 

(local) 

2146-16395 

June 

17, 

1975 

Norton 

(local) 

2146-16392 

June 

17, 

197 5 

Rawlins 

Cheyenne 

2165-16450- 

July 

6, 

1975 

Sheridan 

Trego 

2146-16395 

June 

17, 

1975 

Sherman 

(local) 

2165-16453 

June 

7, 

1975 

Thomas 

Sherman 

2165-16453 

June 

7, 

1975 

North Central District 

Clay 

Ottawa 

2144-16282 

June 

15, 

1975 

Cloud 

(local) 

2163-16334 

July 

4 , 

1975 

Jewell 

Smith 

2163-16334 

July 

4, 

1975 

Mitchell 

Osborne 

2163-16340 

July 

4, 

1975 

Osborne 

(local) 

2163-16340 

July 

4, 

1975 

Ottawa 

(local) 

2144-16282 

June 

15, 

1975 

Phillips 

Norton 

2146-16392 

June 

17, 

1975 

Republic 

Cloud 

2163-16334 

July 

4 , 

1975 

Rooks 

Graham 

2146-16395 

June 

17, 

1975 

Smith- 

(local) 

2163-16334 

July 

4 , 

1975 

Washington 

Cloud 

2163-16334 

July 

4, 

1975 

West Central District 
Gove 

Trego 

2146-16395 

June 

17, 

1975 

Greeley 

(local) 

2165-16453 

July 

6, 

1975 

Lane 

Trego 

2146-16395 

June 

17, 

1975 

Logan 

Wallace 

2165-16453 

July 

6, 

1975 

Ness 

(local) 

2146-16395 

June 

17, 

1975 

Scott 

Greeley 

2165-16453 

July 

6, 

1975 

Trego 

(local) 

2146-16395 

June 

17, 

1975 

Wallace 

(local) 

2165-16453 

July 

6, 

197-5 

Wichita 

Greeley 

216-5-16453 

July 

6, 

1975 

.Central District 

Barton 

(local) 

2163-16340 

July 

4 , 

1975 

Dickinson 

Saline 

2144-16282 

June 

15, 

1975 

Ellis 

Trego 

2146-16395 

June 

17, 

1975 
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Table A1 • (continued) 


Central District (cont.) 


Ellsworth 

Rus sell 

.2163-16340 

July 

4, 1975 

Lincoln 

Russell 

2163-16340 

July 

4, 1975 

McPherson 

(local) 

2144-16282 

June 

15, 1975 

Marion 

McPherson 

2144-16282 

June 

15, 1975 

Rice 

Barton 

2163-16340 

July 

4, 1975 

Rush 

Trego 

2146-16395 

June 

17, 1975 

Russell 

(local) 

2163-16340 

July 

4, 1 9 7*5 

Saline 

(local) 

2144-16282 

June 

15, 1975 

Southwest District 

Clark 

Ford 

5032-16310 

May 

21, 1975 

Finney 

(local). 

5032-16310 

May 

21, 1975 

Ford 

(local) 

5032-16310 

May 

21, 1975 

Grant 

Hamilton 

2147-16460 

June 

18, 1975 

Gray 

Haskell 

5032-16310 

May 

21, 1975 

Hamilton 

(local) 

2147-16460 

June 

18, 1975 

Haskell 

(local) 

5032-16310 

May 

21, 1975 

Hodgeman 

(local) 

2146-16395 

June 

17, 1975 

Kearney 

Hamilton 

2147-16460 

June 

18, 1975 

Meade 

Ford 

5032-16310 

May 

21, 1975 

Morton 

Stanton 

2147-16460 

June 

18, 1975 

Seward 

.(local) 

5032-16310 

May 

21, 1975 

Stanton 

(local) 

2147-16460 

June 

18, 1975 

Stevens 

Hamilton 

2147-16460 

June 

18, 1975 

South Central District 
Barber 

(local) 

2073-16342 

April 

5, 1975 

Barber 

(local) 

2109-16341 

May 

11, 1975 

Comanche 

Pratt 

2073-16342 

April 

5, 1975 

Comanche 

Pratt 

2109-16341 

May 

11, 1975 

Edwards 

Pratt 

2073-16342 

April 

5, 1975 

Edwards 

Pratt 

2109-16341 

May 

11, 1975 

Harper 

Sumner 

2072-16284 

April 

4, 1975 

Harper 

Sumner 

2144-16284 

June 

15, 1975 

Harvey 

(local) 

2072-16284 

April 

4, 1975 

Harvey 

(local) 

2144-16284 

June' 

15, 1975 

Kingman 

Pratt 

2073-16342 

April 

5, 1975 

Kingman 

Pratt 

2109-16341 

May 

11, 197-5 

Kiowa 

Pratt 

2073-16342 

April 

5, 1975 

Kiowa 

Pratt 

2109-16341 

May 

11, 1975 

Pawnee 

Stafford 

2073-16342 

April 

5, 1975 

Pratt 

(local) 

2073-16342 

April 

5, 1975 

Pratt 

(local) 

2109-16341 

May 

11, 1975 

Reno 

Stafford 

2073-16342 

April 

5, 1975 

Sedgwick 

Sumner 

2072-16284 

April 

4, 1975 


BEPROPUCIBILITY of the 
PAGE IS POOR 
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Table A1 . (continued) 


South Central District (cont.) 


Sedgwick 

Sumner 

2144-16284 

June 

15, 

1975 

Stafford 

(local) 

2073-16342 

April 

5, 

1975 

Sumner 

(local) 

2072-16284 

April 

4, 

1975 

Sumner 

(local) 

2144-16284 

June 

15, 

1975 

Southeast District 

Allen 

(local) 

2142-16171 

June 

13, 

1975 

Allen 

(local) 

2107-16225 

May 

9 , 

1975 

Bourbon 

Allen 

2142-16171 

June 

13, 

1975 

Butler 

Harvey 

2144-16284 

June 

15, 

1975 

Chautauqua 

Allen 

2107-16225. 

May 

9 , 

1975 

Cherokee 

Allen 

2142-16171 

June 

13, 

1975 

Cowley 

Sumner 

2144-16284 

June 

15, 

1975 

Crawford 

Allen 

2142-16171 

June 

13, 

1975 

Elk 

Allen 

2107-16225 

May 

9 , 

1975 

Greenwood 

Allen 

2107-16225 

May 

9 , 

1975 

Labette 

Allen 

2142-16171 

June 

13, 

1975 

Montgomery 

Allen 

2107-16225 

May 

9 , 

1975 

Neosho 

Allen 

2142-16171 

June 

13, 

1975 

Wilson 

Allen 

2107-16225 

May 

9 , 

197-5 

Woodson 

Allen 

2107-16225 

May 

9 , 

1975 
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Table A2. Summary of Landsat scenes and sources of training 
statistics used for classification in Indiana. 


Source of 

Training Landsat 

County Statistics Scene - Date 


Northwest District 


Benton 

■ (local) 

2228-15522 

Sept. 

7, 

1975 

Jasper 

Newton 

2228-15515 

Sept. 

7, 

1975 

Lake 

(local) 

2228-15515 

Sept, 

7, 

1975 

LaPorte 

(local) 

2228-15515 

Sept , 

7, 

1975 

Newton 

(local) 

2228-15515 

Sept . 

7, 

1975 

Porter 

Lake 

2228-15515 

Sept. 

7, 

1975 

Pulaski 

(local) 

2228-15515 

Sept. 

7, 

1975 

Starke 

(local) 

2228-15515 

Sept. 

7, 

1975 

White 

(local) 

2228-15522 

Sept. 

7, 

1975 

West Central District 

Clay 

Vigo 

2173-15480 

July 

14, 

1975 

Fountain 

(local) 

2228-15522 

Sept . 

7, 

1975 

Montgomery 

(local) 

2209-15464 

Aug. 

19, 

1975 

Owen 

(local) 

2173-15480 

July 

14, 

1975 

Parke 

(local) 

2228-15522 

Sept . 

7, 

1975 

Putnam 

Owen 

2173-15480 

July 

14, 

1975 

Tippecanoe 

(local) 

2228-15522 

Sept . 

7, 

1975 

Vermillion 

Parke 

2228-15522 

Sept . 

7, 

1975 

Vigo 

(local) 

2173-15480 

July 

14, 

1975 

Warren 

(local) 

2228-15522 

Sept . 

7, 

1975 

Central District 

Bartholomew 

Decatur 

2208-15412 

Aug. 

18, 

1975 

Boone 

Hamilton 

2209-15464 

Aug. 

19, 

1975 

Clinton 

Tipton ' 

2209-15464 

Aug. 

19, 

1975 

Decatur 

(local) 

2208-15412 

Aug. 

18, 

1975 

Grant 

(local) 

2209-15464 

Aug. 

19, 

1975 

Hamilton 

(local) 

2209-15464 

Aug. 

19, 

1975 

Hancock 

(local) 

2208-15405 

Aug. 

18, 

1975 

Hendricks 

Hamilton 

2209-15464 

Aug. 

19, 

1975 

Howard 

(local) 

2209-15464 

Aug. 

19, 

1975 

Johnson 

(local) 

2208-15412 

Aug. 

18, 

1975 

Madison 

(local) 

2208-15405 

Aug. 

18, 

1975 

Marion 

Hamilton 

2209-15464 

Aug. 

19, 

1975 

Morgan 

Owen 

2173-15480 

July 

14, 

1975 

Rush 

Shelby 

2208-15412 

Aug. 

.18, 

1975 

Shelby 

(local) 

2208-15412 

Aug. 

18, 

1975 

Tipton 

(local) 

2209-15464 

Aug. 

19, 

1975 
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Table A2. (continued) 


East Central District 


Blackford 

Jay 

2208-15405 

Aug. 

18, 

1975 

Delaware 

Randolph 

2208-15405 

Aug. 

- 18, 

1975 

Fayette 

(local) 

2208-15412 

Aug. 

18, 

1975 

Henry 

Wayne 

2208-15405 

Aug.. 

18, 

1975 

Jay 

(local) 

2208-15405 

Aug. 

18, 

1975 

Randolph 

(local) 

2208-15405 

Aug. 

18, 

1975 

Union 

Fayette 

2208-15412 

Aug. 

18, 

1975 

Wayne 

(local) 

2208-15405 

Aug. 

18, 

1975 
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